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PREFACE 
TO THE THIRD EDITION 





The present (third) edition of this outstanding book by A. Ya. Khin- 
chin was undertaken by the State Press for Physics and Mathematics 
only after the death of its author. For that reason, the book is being 
reprinted without change except for brief remarks of a bibliographic 
nature, indicated by my initials (B. G.). 

Although it has been more than a quarter of a century since Khin- 
chin wrote this book, it still maintains its original freshness. It is, thus, 
not without reason that many editions of the book have been printed 
in various countries in the last decade. Furthermore, in connection 
with the development of new methods of computational technology, a 
natural interest has arisen in computational algorithms, including that 
of continued fractions. Along these lines, there is a useful monograph 
by A. N. Khovanskil, Priloshenie tsepnykh drobe: i ikh obobshchemi k 
voprosam priblizhennogo analisa (“The Application of Continued Frac- 
tions and Their Generalizations to Problems in Approximative Anal- 
ysis”), published in 1956. Although Khinchin had a different purpose, 
his book can still serve as an excellent introduction both to the study 
of the algorithms of continued fractions and to the profound and in- 
teresting problems of the measure theory of numbers. Khinchin de- 
voted much time and displayed much initiative in the development of 
this theory. To a considerable degree, the entire third chapter is the 
result of his investigations. 

I am hoping that this book will be widely read with the same fascina- 
tion with which many persons, one of them being the author of these 
lines, read it twenty-five years ago. 

B. V. GXEDENKO 


PREFACE 
TO THE SECOND EDITION 


This edition is a reprint of the first with no significant changes. No 
other monographs on continued fractions have appeared in Russian 
since publication of the first edition. Among the more general works 
on the theory of numbers that contain information on continued frac- 


tions, the works of D. A. Glava, V. A. Venkov, and I. V. Arnold may 
be mentioned. 


A. KHINCIIIN 
October 1949 


FROM THE PREFACE 
TO THE FIRST EDITION 





The theory of continued fractions deals with a special algorithm that 
is one of the most important tools in analysis, probability theory, me- 
chanics, and, especially, number theory. The purpose of the present 
elementary text is to acquaint the reader only with the so-called regular 
continued fractions, that is, those of the form 


ao + 


usually with the assumption that all the elements a; (i > 1), are posi- 
tive integers. This most important and, at the same time, most thor- 
oughly studied class of continued fractions is at the basis of almost all 
arithmetic and a good many analytic applications of the theory. 

I feel that an elementary monograph on the theory of continued 
fractions is necessary because this theory, which formerly was a part 
of the mathematical program at the intermediate level, has now been 
dropped from that program, and hence is no longer included in the 
new textbooks on elementary algebra. On the other hand, the curricula 
at the more advanced levels (even in the mathematics divisions of 
universities) also omit this theory. 

Since the basic purpose of this monograph is to fill the gap in our 
textbook literature, it necessarily had to be elementary and, to as 
great a degree as possible, accessible. Its style is in large measure de- 
termined by this fact. Its content, however, goes somewhat beyond the 
limits of that minimum absolutely necessary for any application. This 
remark applies chiefly to the entire last chapter, which contains the 
fundamentals of the measure (or probability) theory of continued 
fractions—an important new field developed almost entirely by Soviet 
mathematicians; it also applies to quite a number of items in the sec- 
ond chapter, where I attempted, to the extent possible in such an ele- 


ix 
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mentary framework, to emphasize the basic role of the apparatus of 
continued fractions in the study of the arithmetic nature of irrational 
numbers. I felt that if the fundamentals of the theory of continued 
fractions were going to be published in the form of a separate mono- 
graph, it would be a shame to leave unmentioned those highlights of 
the theory which are the subject of the greatest amount of contem- 
porary study. 

As regards the arrangement of the material, it need only be men- 
tioned that the “formal” part of the study is contained in a special 
preliminary chapter. In this chapter, the elements of the continued 
fractions are assumed to be arbitrary positive numbers (not necessarily 
integers) and often—even more generally—simply independent vari- 
ables. A drawback to such a separate presentation is the fact that the 
formal properties of the apparatus being studied are submitted to the 
reader before the subject matter itself and, therefore, are divorced 
from it. This is no doubt undesirable from a pedagogical standpoint. 

However, a greater methodological precision is to be attained by 
this approach (because the reader can see immediately which properties 
of continued fractions come from the very structure of the apparatus 
and which exist only under the assumption of positive integral ele- 
ments). Such a separate introductory exposition of the formal part of 
the study also makes possible the subsequent development of the 
arithmetic theory (which is the main theme of the study) on an already 
prepared formal base. Thus, the reader’s attention may be concen- 
trated on the content of the material being expounded, without divert- 
ing it for purely formal considerations. 

A. KaINCHIN 
Moscow 
February 12, 1935 
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Chapter I 


PROPERTIES OF THE 
APPARATUS 





1. Introduction 


An expression of the form 
l 
a + 7 (1) 
Bepe 


Qjgt... 





is called a regular or simple continued fraction. The letters do, a1, a, 
+--+ , in the most gencral treatment of the subject, denote independent 
variables. In particular cases, these variables may be allowed to take 
values only in certain specified domains. Thus, do, a1, @2, +++ may be 
assumed to be real or complex numbers, functions of one or several 
variables, and so on. For the purposes of the present book, we shall 
always assume 4&1, de, "° to be positive integers; ao may be an arbitrary 
real number. We shall call these numbers the elements of the given con- 
tinued fraction. The number of elements may be either finite or infinite. 
In the first case, we shall write the given continued fraction in the form 
l l 
cl a 
a, + — (2) 


An 


and call it a fnile continued fraction—more precisely an zth-order con- 
tinued fraction (so that an #zth-order continued fraction has n + i 
elements}; in the second case, we shall write the continued fraction in 
the form (1) and call it an izfinile continued fraction. 

Every finite continued fraction is the result of a finite number of ra- 
tional operations on its elements. Therefore, under our assumptions re- 
garding the elements, every finite continued fraction is equal to some 
real number. In particular, if all the elements are rational numbers, the 
fraction itself will be a rational number. On the other hand, we cannot 


I 
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immediately assign any numerical values to an infinite continued frac- 
tion. Until we adopt some convention, it is only a formal notation, 
similar to that for an infinite series whose convergence or divergence 
is not brought into question. Of course, it can, nonetheless, be the 
subject of mathematical investigations. 

Let us agree for reasons of technical convenience to write the in- 
ünite continued fraction (1) in the form 


[ao a A, ...], (3) 
and the finite continued fraction (2) in the form 
[age Bhi Ae san Aal (4) 


thus, the order of a finite continued fraction is equal to the number of 
svmbols (elements) after the semicolon. 
Let us agree to call the continued fraction 


Sp==[@q; Gy, Gy, «++, agh 


where O < k < n, a segment of the continued fraction (4). Similarly, 
for arbitrary k > 0, we shall call s, a segment of the infinite continued 
fraction (3). Obviously, any segment of any continued fraction (finite 
or infinite) is itself a finite continued fraction. Let us also agree to call 
the continued fraction 


Fe = (la; Opti eer Anl 


a remainder of the finite continued fraction (4). Similarly, we shall call 
the continued fraction 


Te = [0k Grane Span +++] 


a remainder of the infinite continued fraction (3). Obviously, all the 
remainders of a finite continued fraction are finite continued fractions 
and all the remainders of an infinite continued fraction are infinite 
continued fractions. 

lor inite continued fractions, it follows that 


[ao Qi, An, wey a,| 


== [2 @,, ag ..., apop 72] (OS RN). 


The analogous relationship 


(9) 


[aoi 2), Qo, ...J—=[ag} a), Gy, ..., Gp_y, Te] = (RO) 
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for infinite continued fractions can be meaningful only as a formal 
(trivial) notation since the element r; on the right side of this equation, 
being an infinite continued fraction, has no definite numerical value. 


2. Convergents 
Every finite continued fraction, 


[ao Ajs Ox sane aa] 


being the result of a finite number of rational operations on its ele- 
ments, is a rational function of these elements and, consequently, can 
be represented as the ratio of two polynomials 

P (ap ais... ap) 

Q (ao ar.. an) 
in do, 21, *** , Gn, With integral coefficients. If the elements have nu- 
merical values, the given continued fraction is then represented in 
the form of an ordinary fraction p/q. However, such a representation 
is, of course, not unique. For what follows, it will be important for us 
to have a definite representation of a finite continued fraction in the 
form of a simple fraction—a representation which we shall call canoni- 
cal. We shall defne such a representation by induction. 

For a zeroth-order continued fraction, 


[a] = Qy ? 


we take as our canonical representation the fraction &o/1. Suppose now 
that canonical representations are defined for continued fractions of 
order less than 2. By equation (5), an sth-order fraction 


l 
[2 ; Ai» ce se aal = (a: N= Gres 
Here, 


P= 10)3 Ga seen Bal 


is an (z — 1)st-order continued fraction, for which, consequently, the 
canonical representation is already defined. Let us represent it as 


then, 
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We shall take this last fraction as our canonical representation of the 
continued fraction [ao; a), ++" , an). Thus, by setting 


[ap ap eee A.) 


r= la; Qo, oe 4 a 
we have the following expressions for the numerators and denominators 
of these canonical representations: 


p=ayp'-+q', q= p. (6) 


Thus, we have uniquely defined canonical representations of continued 
fractions of all orders. 

In the theory of continued fractions, an especially important role is 
plaved by the canonical representations of the segments of a given 
(finite or infinite) continued fraction a = [do; @1, d2, °-+]. We shall 
denote by p/q; the canonical representation of the segment 


Sp = [ay @,, Ag, - + es a,l 


of the continued fraction, and we shall call it the kth-order convergent 
(or approximant) of the continued fraction a. This concept is defined 
in exactly the same way for finite and infinite continued fractions. The 
only difference is that a finite continued fraction has a finite number 
of convergents, whereas an infinite continued fraction has an infinite 
number of them. Ior an zth-order continued fraction a, obviously 


Pa 
dn 


= 1; 


such a continued fraction has n + 1 convergents (of orders, 0, 1, 2, 
ove ; n). 
THEOREM 1 (the rule for the formation of the convergents). For arbi- 
lrary k > 2, 
Pr = AkPr-1 F Pr_o 
(7) 


Ir = 8 dp-1 + Ip -2- 


Proor. In the case of k = 2, the formulas in (7) are easily verified 
directly. Let us suppose that they are true for all $ < n. Let us then 
consider the continued fraction 


(A i3 05s. 444505] 
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and let us denote by p’,/q’, its rth-order convergent. On the basis of 
the formulas in (6), 


Pa = APh- a ee 


/ 


Gn Pest: 
And since, by hypothesis, 


/ a 7 
Pa-1 ~ A aPn-o aa eae 
Po , t 
Ps te aaan- as Gn-3 
(here, we have a, rather than an-ı because the fraction fa); a2, +++ , an] 
begins with a; and not with ao), it follows on the basis of (6) that 


7 f 
Pa Z adi (2,,P_—2 + P-a) aa CA +9,_3) 
NA 7 / r 
— a, (2,P,-2 +4, 9) + (aa Pa -3 +F gaa) 
=4nPa-) Pism 
f f 
Irn T Q Pa? a P, i~ aafn-1 Tiie 
which completes the proof. 
These recursion formulas (7), which express the numerator and de- 
nominator of an nth-order convergent in terms of the element an and 
the numerators and denominators of the two preceding convergents, 


serve as the formal basis of the entire theory of continucd fractions. 
REMARK. It is sometimes convenient to consider a convergent of 


order —1; in this case, we set p-1 = 1 and g_1 = 0. Obviously, with 
this convention (and only then), the formulas of (7) retain their valid- 
ity for k = 1. 
THEOREM 2. For all k > 0, 
IePr-1 — PrIn-1 = (— IF. (8) 


Proof. Multiplying the first formula of (7) by gx—-1 and the second 
by pr-ı and then subtracting the first from the second, we obtain 


IkPr-1 — Phar- = — Ge-1Pr—-2 — Pr-1%k-2)) 
and since 


goP-1— Pol -1 = l» 


the theorem is proved. 
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COROLLARY. For all k = 1, 


k 
Pra _ Pe COCV l (9) 


TuCorRM 3. For alil k > 1, 
k-1 
QkPr-2 — Prlr-2 = (— 1) ag 


Proor. By multiplying the first formula of (7) by qs— and the sec- 
ond by p,— and then subtracting the first from the second, we obtain, 
on the basis of Theorem 2, 


k-] 
IkPr-2— Pralr-2 = Fn (Vp—1Pp-2 — Pr-14k -2) = (— 1) aps 


which completes the proof. 
COROLLARY. For all k = 2, 


Ge ge T (10) 


The simple results that we have just obtained make it easy for us 
to reach certain very important conclusions concerning the relative 
values of the convergents of a given continued fraction. Specifically, 
(10) shows that the convergents of even order form an increasing se- 
quence and that those of odd order form a decreasing sequence. Thus, 
these two sequences tend toward each other (all this under our assump- 
tion that the elements from a on are positive). Since, by (9), every 
odd-order convergent is greater than the immediately following even- 
order convergent, it follows that every odd-order convergent is greater 
than any even-order convergent. Therefore, we may draw the follow- 
ing conclusions. 

THEOREM 4. Even-order convergents form an increasing and odd-order 
convergents a decreasing sequence. lso, every odd-order convergent is 
greater than any even-order convergent. 

[tis particularly evident that, for a finite continued fraction a, every 
cvcn-order convergent is less than a and every odd-order convergent 
Is greater Lhan a (except, of course, the last convergent, which is 
equal to a). 

We conclude this section with the proof of two simple, but extremely 
important, propositions concerning the numerators and denominators 
of the convergents. 
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THEOREM 5. For arbitrarvk (1 < k < n), 


Py p T Pe-o 


Ga, Gy. Gay aso, 4| = ; 
[ 0 1 2 nl dpe Fipo (11) 


(Here, pi, qi, r: refer to the continued fraction on the left side of this 
equation.) 
Proof. From (5), 


[a Gy, Ay, +, apl = [ao ap Ay, n, Apps Tel 


The continued fraction on the right side of this equation has as a 
(k — 1)st-order convergent the fraction ps—1/g;—-1. Its kth-order con- 
vergent, x/qx, is equal to the fraction itself; and since from (7) 


Prk = Pr- k Pr- Qr = r-k FIr- 
the theorem is proved. 


THEOREM 6. For arbitrary k È 1, 





Gp 


Proor. For k = 1, this relationship is obvious because it is of the 
form 





qı 
— =. 
qo i 
Suppose that k > 1 and that 
d- 
1, + = [akn ak- eee) G). (12) 
k-2 


On the basis of the equations in (7), 


Tp = U9 p-1 + Te-2 
and we have 





Therefore, from formulas (5) and (12), 
dg 

Vp) 

which completes the proof. 





= [Gg App es A), 
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3. Infinite continued fractions 
To every infinite continued fraction 
[Gps Gis Boe soa); (13) 


there corresponds an infinite sequence of convergents 


P P 

aR A EO te ats (14) 

% 4, Tr 
Every convergent is some real number. If the sequence (14) converges, 
that is, if it has a unique limit a, it is natural to consider this number a 
as the “value” of the continued fraction (13) and to write 


a = [a a); Qo, tals 


The continued fraction (13) itself is then said to converge. If the se- 
quence (14) does not have a definite limit, we say that the continued 
fraction (13) diverges. 

In many of their properties, convergent infinite continued fractions 
are analogous to finite continued fractions. The basic property which 
makes possible the further extension of this analogy is expressed by the 
following theorem. 

THEOREM 7. If the infinite continued fraction (13) converges, so do all 
of its remainders; conversely, if at least one of the remainders of the con- 
tinued fraction (13) converges, the continued fraction itself converges. 

Proor. Let us agree to denote by p,/q; the convergents of a given 
continued fraction (13), and by p’4/q’; the convergents of any one of 
ils remainders, for example, rn. From formula (11), we have 

/ 


Pr 
Prat = Piss 


1045, 9,202 ggg] =——-- (220, 10.0) (0S) 


Pp 
9n-1 + + 4n-2 
dgk 


Pn- R 
n-k 


It follows immediately that if the remainder r, converges, that is, if 

as k— œ the fraction p’,/g’, approaches a limit which we shall also 

denote by ra, then the fraction pa+Ł/qn+x will converge to a limit a 

cqual to 1 
Pn-i'n T Pn-2 

=> fae nV east, 1 
Gn-i a + 4n-2 (16) 


sy solving (15) for p'+/q'rk, we establish the validity of the converse, 
thus completing the proof of the theorem. 


g. 
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We note that formula (16), which we have just established for con- 
vergent infinite continued fractions, is exactly analogous to formula 
(11), which we proved earlier for finite continued fractions. Similarly, 
the theorem analogous to Theorem 5 holds for infinite continued frac- 
tions. 

The following propositions for convergent infinite continued frac- 
tions follow directly from Theorem 4 of the preceding section. 

THEOREM 8. The value of a convergent infinite continued fraction is 
greater than any of ils even-order convergents and is less than any of its 
odd-order convergents. 

Furthermore, on the basis of this theorem, the corollary to Theo- 
rem 2 of the preceding section implies the following result, which plays 
a basic role in the arithmetic applications of the theory of continued 
fractions. 

THEOREM 9. The value a of the convergent infinite continued fraction 
(13) for arbitrary k > O satisfies the inequality 


l 








Obviously, Theorem 9 is also valid for the finite continued fraction 


a = [a0 a, Ay, ..-, aal, 


for all k < n, except that, for the single case of k = n — 1, the in- 
equality must be replaced by equality, since a = pn/qn. If a is the value 
of a convergent infinite continued fraction (13), we shall also refer to 
the elements of that continued fraction as the elements of the number a. 
Similarly, we shall refer to the convergents, segments, and remainders 
of the continued fraction (13) as the convergents, segments, and re- 
mainders, respectively, of the number a. On the basis of Theorem 7, all 
the remainders of a convergent infinite continued fraction (13) have 
definite real values. 

The question naturally arises as to whether there are tests for the 
convergence of continued fractions, just as for infinite series. In the 
case with which we are concerned, that is, when a; > 0, for all 7 > 1, 
there exists an extremely simple and convenient test for convergence. 


1 We note that, under our assumptions, gẹ > 0, for all & > O (since go = 1 and 
qı = a, we can show by induction from the second part of eq. [7] that gx > 0, 
for all $ > 1). 
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Trrorem 10. For the continued fraction (13) to converge, if is neces- 
sary and suthcient that the series 


Den (17) 
n=1 


diverge. 

PROOF. It clearly follows from Theorem 4 that a necessary and suff- 
cient condition for the convergence of an infinite continued fraction is 
that the two sequences referred to in that theorem have the same limit. 
(Theorem + clearly implics that each of these sequences has a limit.) 
And, as formula (9) shows, this is the case if and only if . 


dlr >œ as k>. (18) 


Thus, condition (18) is necessary and sufficient for the convergence of 
a given continued fraction. 

Suppose that the series (17) converges. From the second formula 
of (7), 


Ir > dr-2 (k > |). 
Therefore, for arbitrary k, we have either q > ge—1 Or gr—1 > qk—2- 
In the first case, the second formula of (7) yields 
Ir < Okr Fap- 


and therefore, for sufficiently large k (when a; < 1, which, because 
of the convergence of the series in eq. [17], must be the case for k > ko), 
we have 





Tp 
< TE 


In the second case, the same formula gives, for a, < 1, 


Ir- 
l — ap f 





I, < Cl +a,) Je- Í 
Thus, for all k > ko, we have 
1 
Ir < Ia e 


where 1 < k. IFI > ko, we may apply the same inequality to q:. 
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By continuing this reasoning, we arrive at the inequality 


qs 
Te < (1 — ap) (1 — a)... (1—a,)’ (19) 


where k >l > +++ >r> k and s< ko. But, because of the as- 
sumed convergence of the series in (17), the infinite product 


co 


Üe -e 


as we know, converges: that is, it has a positive value, which we denote 
by A. Obviously, 


R 
(ea) asa), (1—a,) > [Jd —a,) =. 


Therefore, if we denote by Q the largest of the numbers go, qi, «°° 
gko—1, We conclude from inequality (19) that 


<T Ek 
consequently, 0? 
Witte <x (R> ko) 


and the relationship in (18) cannot hold. Therefore, the given continued 
fraction diverges. 

Conversely, suppose that the series in (17) diverges. Since gx > qr—?, 
for all $ > 2, if we denote by c the smallest of the numbers go, qi, we 
have k > 0, for arbitrary qx > c. Therefore, the second formula of (7) 
gives us 

Ge=Te-2 +00, (k>2). 


Successive application of this inequality gives us 


k 
Jorn > Ig + C È Orn 
and 
= 
Joti an +e 3 o>, +) 
so that 


2k+1 


don Faart > Yo FM T° pa Ons 
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in other words, for all k, 
k 
Tp + gk- > € Èi ap 


We have already proved this inequality for odd values of k, and it 
can obviously be established for even values of k by the same method. 
It then follows that at least one of the factors in the product gzgz-1 
exceeds 4cXf_1@,, and since the other factor is never less than c, we 


have k 
e NN 
VeIk-1 > T 2 an: 


n=l 


Because of the assumed divergence of the series in (17), this implies 
relationship (18) and, consequently, the convergence of the given con- 
tinued fraction. This completes the proof of Theorem 10. 


4, Continued fractions with natural elements 


From this point until the end of the book, we shall assume that ele- 
ments @), €z, ++» are natural numbers, that is, positive integers, and 
that @ is an integer, though not necessarily positive. If such a con- 
tinued fraction is infinite, Theorem 10 ensures its convergence. There- 
fore, we can henceforth freely assume that any continued fraction that 
we are dealing with is convergent, and we can speak of its “value”. 
If such a continued fraction is finite, and if its last element (an) is 1, it 
is evident that r,-1 = @,_1 + 1 is an integer. Therefore, in this case, 
we can write the given #th-order continued fraction [@o; a1, de, «°°, 
d,—1, 1] in the form of an (n — 1)st-order continued fraction [@9; 41, a2, 
"++ @n—ı + 1]; in this new form, the last element is clearly greater 
than unity. 

Because of this ract, in all that follows we can exclude from consider- 
ation finite continued fractions whose last elements are equal to unity 
(except, of course, for the zeroth-order fraction [1]). This plays an im- 
portant role in the question of the uniqueness of the representation 
of numbers by continued fractions (see Chap. II, sec. 5). 

Obviously, the numerators and denominators of the convergents, in 
the case now under consideration, are integers. (For p-1, ¢-1, po, 
and go, this can be seen immediately, and for the numerators and de- 
nominators of the remaining convergents, it follows from the formulas 
in eq. [7].) Furthermore, we have the following very important propo- 
sition. 

THEOREM 11. All convergents are irreducible. 
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The proof follows immediately from formula (8), since any common 
divisor of the numbers p, and g, would at the same time be a divisor 
of the expression gnpa—1 — PaGn—l- 

The second formula of (7) shows that q; > qr-ı, for every k > 2. 
Therefore, the sequence 


di» Go oo ey ps ees 
is always increasing. We have a much stronger proposition concerning 


the rate of increase of the numbers gz. 
THEOREM 12. For arbitrary? k > 2, 


ProorF. For k = 2, 
qr = FG p-1 F Ir-2 > Ir-1 IV p-2 > Ar- 
Successive application of this inequality yields 
qor > 2¥qy= 2", ope > 2 


which proves the theorem. Thus, the denominators of the convergents 
increase at least as rapidly as the terms of a geometric progression. 

Intermediate fractions —Suppose that k > 2 and that 7 is an arbi- 
trary negative integer. The difference 


Pp CEU Py _ Prosi T Peo 
da HDH apa HI po 


which, as is easily seen, is equal to 


(—1)* 
(7p,-.C FD AG, ol [pe _yh + Mpa] 


has the same sign for all 7 > 0, depending only on whether & is even 
or odd. It follows from this that the fractions 


Pg- Pp_o F PR) Pe-o T ?Pp-j Pp- F kPr-i__ Pe (20) 
eS eS coi \ 


Ce 


Ir- Pe Fg- Ir- fag- es Tr 


2 Here, and in all that follows, in the case of a finite continued fraction only those 
values of & for which gs is meaningful are to be considered. 
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form, for even &, an increasing and, for odd k, a decreasing sequence 
(see Theor. +). The first and last terms of this sequence are either both 
even- or both odd-order convergents. The intervening terms (if there 
are any, that is, if a, > 1), we shall call intermediate fractions. In arith- 
metic applications, these intermediate fractions play an important 
role (though not as important a role as the convergents). To make 
their mutual disposition and the law of their progressive formation 
clearer, it is convenient to introduce the concept of the so-called 
mediant of two fractions. 

The mediant of two fractions a/b and c/d, with positive denomina- 
tors, is the fraction 


a+c 
b+d° 





LEMMA. The mediant of two fractions always lies between them in value. 

Proor. Suppose, for definiteness, that a/b < c/d. Then, bc — 
ad > 0, and, consequently, 

ate a 6c—ad a+c c _ ad—be 

b+d b b(b+d) > 0, btd d` b(b+d) <0, 


which proves the lemma. 

We see immediately that each of the intermediate fractions in the 
progression of (20) is the mediant of the preceding fraction and the 
fraction px_i/gx_1. By going through progression (20) and successively 
forming the mediants, we proceed from the convergents pr—2/qx—2 In 
the direction of the convergents px—i/gi—1. The concluding step in 
this sequence will occur when the mediant constructed coincides with 
pi/ge. This last fraction lies between p,-1/gx-1 and pr-2/qk-2, as we 
know from Theorem +. We also know that the value a of the given 
continued fraction lies between pr—1/qr-1 and pr/qr, and that the 
fractions px—92/gx—2 and px/gx, which are cither both of even order or 
both of odd order, lie on the same side of the number a. It follows from 
this that the entire progression in (20) lies on one side of the number a 
and that the fraction pr—1/gr—ı lies on the other side. In particular, the 
fractions (pe-1 + pro)/(Ge-1 + qr—) and px—i/qx_1 are always on 
opposite sides of a. In other words, the value of a continued fraction 
always lies between an arbitrary convergent and the mediant of that con- 
tergent and the preceding one. (We suggest that the reader make a draw- 
ing to illustrate the relative positions of all these numbers.) 

This remark indicates a method whereby, if we know the conver- 
gents pr_o/gz—2 and pr_i/gz_1, We Can construct the subsequent con- 
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vergent z/g without knowing the element a, (but using our knowl- 
edge of the value a of the continued fraction). Specifically, we first 
take the mediant of the two given fractions, then the mediant of this 
mediant and px_1/qx-1, and so on, each time taking the mediant of the 
mediant just obtained and the fraction p;—1/qr-1. We already know 
that these consecutive mediants will initially approximate a. The last 
mediant of this progression that lies on the same side of a as does the 
initial fractions pr_o/Qr—2 is px/ qu. For, as we already know, px/q; lies 
somewhere among the mediants in the progression, and on the same 
side of a as px—2/gx—2. Therefore, it only remains for us to show that 
the subsequent mediant will lie on the opposite side of a. But the last 
mediant is (p + ps-1)/(ge + qr-1) and, on the basis of the remark 
made above, it does indeed lie on the opposite side of the number a. 
There is another even more important consequence of the relative 
positions of a number a, and its convergents and intermediate frac- 
tions. The intermediate fraction (pe + peyi)/(ge + qei), since it is 
between px/gx and a, lies closer to p/q; than does a; that is, 


Pet Peo Pelt 
ree Tp Tkp Fika) 











(Equality is impossible here because this would indicate that 
Pe T Piri Pro oq 
(gi deg ee 
Re kol R12 

that is, that a would be a finite continued fraction with last element 
equal to unity, which we excluded from consideration in the beginning.) 

Thus, we arrive at the following important result. 

THEOREM 13. For all k = 0, 
Pr 


g — — 


Tp 


l 


ante a 21 
Ty (Ina. TIe) (=a 








This inequality, which gives a lower bound for the difference 
|a — (px/qx)|, supplements the inequality exhibited in Theorem 9, 
which provides an upper bound for the same difference. 


Chapter II 


THE REPRESENTATION OF 
NUMBERS BY CONTINUED 
FRACTIONS 





5. Continued fractions as an apparatus for 

representing real numbers 
THEOREM 14. To every real number a, there corresponds a unique con- 
tinued fraction with value equal to a. This fraction is finite tf a is rational 
and infinite if a ts trrational.' 

Proor. We denote by ao the largest integer not exceeding a. If a is 
not an integer, the relation 


1 
a= Ay + 7 (22) 
ry 
allows us to determine the number rı. Here, clearly, rı > 1, since 


l 


In gencral, if 7, is not an integer, we denote by a, the largest integer 
not exceeding r, and define the number 7,41 by the relation 


a +} La (23) 


Fa+t 





This procedure can be continued as long as r, is not an integer; here, 
clearly, 7, > 1 (2 > 1). 
Equation (22) shows that 
a = [a ril. 


Suppose that, in general, 
2G a Woy abny. Wp cee Mls (24) 


! We remind the reader that we are considering continued fractions with integral 
clements, that a; > 0 for7 > 1, and that the last element of every finite continued 
fraction must be different from unity. 


16 
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Then, from equations (5) and (23), we have 


a = [ag; Air Ay, wens Qn-\) Gy, Faih 
thus, (24) is valid for all z (assuming, of course, that ri, re, +*+ , 7ni 
are not integers). 

If the number a is rational, all the r, will clearly be rational. It is 
easy to see that, in this case, our process will stop after a finite number 
of steps. If, for example, ra = a/b, then 


faa ees Sa 


where c < b, since rz — dn < 1. Equation (23) then gives 


a 
c 


PES 


(provided ¢ is not equal to zero, that is, if ra is not an integer; if r, is 
an integer, our assertion is already satisfied). Thus, 7241 has a smaller 
denominator than does 7,. It follows from this that if we consider rı, r2, 

- , we must eventually come to an integer rn = än. But, in this case, 
(24) asserts that the number a is represented by a finite continued 
fraction, the last element of which is a, = 7, > 1. 

If a is irrational, then all the r, are irrational and our process is in- 
finite. Setting 


[Q; ay, Qo, om os, a, = 


(where the fraction pa/Qn is irreducible and g, > 0), we have, by (24) 
and (16) of Chapter I, 


— Pa-i nt Pn-2 (n > 2). 
Gn-\' nT In-2 a 
On the other hand, it is obvious that 


Pn _ Pn-10n F Pn-2 
qn Gn-18n + Jn-2 


a 


so that 
eae <i a (Pn—19n-2 — In-1Pn—2) (fn — an) 
In (Jn-1"n + In-2) (Gn—14n F In—-2) 


and, consequently, 


sa 


l 
a E eee | 
Gn 4 ,- vn F 1a-2) Ce În T 7-9) In 
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Thus, 
Pr 


n 


— a as n —> 00; 


but this means that the infinite continued fraction [&o; a1, a2, ++] has 
as its value the given number a. 

Thus, we have shown that any number a can be represented by a 
continued fraction; this fraction is finite if a is rational and infinite if a 
is irrational. It remains for us to show the uniqueness of the expansions 
that we have obtained. We note first that uniqueness follows essentially 
from the considerations of section 4, Chapter I, where we saw that once 
we know the value of a given continued fraction we can effectively con- 
struct all its convergents and hence all its elements. However, the re- 
quired uniqueness can be established in a much simpler manner. Sup- 
pose that 


a = [Q); Qi, Qo, aoa |= [a a; a, > el 


where the two continued fractions may be either finite or infinite. Let 
us denote by [x] the largest integer not exceeding x. First of all, it is 
obvious that ao = [a] and a'o = [a], so that ao = a'o. Furthermore, if 
it is established that 


a = a; (i =0, 1, 2,..., n), 
then, in analogous notation, 
Pi 7 P; 
; (£=20; l, 2, , n), 
q,= 9; 


and, on the basis of formula (16) of Chapter I, 


for ? t 
Priv n+ + Pn- = Pa’ nsi F Pn-1 Prvaz + Pn- 
SSS O 
Gal n+} T Jn- Vil nel T In—1 Gn'n] T In—1 


so that, rag. = Magi. Since angi = [7n41] and a’n41 = [rnga], we have 
nti = Ungi: that is, the two fractions coincide completely. 

We note that the above argument would be impossible if we admit- 
ted finite continued fractions with the last element equal to unity; if, 
for example, @,41 = 1 were such a last element, we would have 7, = 
ad, + 1 anda, Æ [rn]. 

We have just shown that real numbers are uniquely represented by 
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continued fractions. The basic significance of such a representation 
consists, of course, in the fact that, knowing the continued fraction 
that represents a real number, we can determine the value of that num- 
ber with an arbitrary prestated degree of accuracy. Therefore, the ap- 
paratus of continued fractions can, at least in principle, claim a role 
in the representation of real numbers similar to that, for example, of 
decimal or of systematic fractions (that is, fractions constructed ac- 
cording to some system of calculation). 

What are the basic advantages and shortcomings of continued frac- 
tions as a means of representing the real numbers in comparison with 
the much more widely used systematic representation? To answer this 
question, we need first to have a clear picture of the demands that may 
and should be made of such a representation. Clearly, the first and 
basic theoretical demand should be that the apparatus reflect as much 
as possible the properties of the number that it represents, so that these 
properties may be brought out as completely and as simply as possible 
each time that the representation of the number by this apparatus 
is given. 

With respect to this first demand, continued fractions have an un- 
deniable and considerable advantage over systematic (and, in par- 
ticular, decimal) fractions. We shall gradually see this during the 
course of the present chapter. To a degree, in fact, this is clear even 
from a priori considerations. Since a systematic fraction is connected 
with a certain system of calculation, it therefore unavoidably rellects, 
not so much the absolute properties of the number that it represents, 
as its relationship to that particular system of calculation. Continued 
fractions, on the other hand, are not connected with any system of 
calculation; they reproduce in a pure form the properties of the num- 
ber that they represent. Thus, we have already seen that the ration- 
ality or irrationality of the number represented finds complete ex- 
pression in the finiteness or infiniteness of the continued fraction 
corresponding to it. As we know, for systematic fractions the corre- 
sponding test is considerably more complicated: the finiteness or in- 
finiteness of the representing fraction depends not just on the number 
represented but also, in a very real way, on its relationship to the 
chosen system of calculation. 

However, besides the basic theoretical demands that we have men- 
tioned, certain demands of a practical nature should naturally be made 
for any apparatus that is used to represent numbers. (Some of these 
practical considerations may also have certain theoretical value.) Thus, 
it is of great importance that the apparatus make it possible and rea- 
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sonably easy to find values that approximate the represented number 
with any arbitrary degree of accuracy. The apparatus of continued 
fractions satisfies this demand to a very high degree (and, in any case, 
better than does the apparatus of systematic fractions). In fact, we 
shall soon see that the approximating values given by continued frac- 
tions have, in a certain extremely simple and important sense, the 
property of being the bes! approximations. 

There is, however, another and yet more significant practical de- 
mand that the apparatus of continued fractions does not satisfy at all. 
Knowing the representations of several numbers, we would like to be 
able, with relative ease, to find the representations of the simpler func- 
tions of these numbers (especially, their sum and product). In brief, 
for an apparatus to be suitable from a practical standpoint, it must ad- 
mit sufficiently simple rules for arithmetical operations; otherwise, it 
cannot serve as a tool for calculation. We know how convenient sys- 
tematic fractions are in this respect. On the other hand, for continued 
fractions there are no practically applicable rules for arithmetical oper- 
ations; even the problem of finding the continued fraction for a sum 
from the continued fraction representing the addends is exceedingly 
complicated, and unworkable in computational practice. 

The advantages and shortcomings of continued fractions as com- 
pared with systematic fractions determine (to a great extent) the areas 
of application of these two representations. Whereas, in computation, 
systematic fractions are used almost exclusively, the apparatus of con- 
tinued fractions finds its primary application in theoretical investiga- 
tions involving the study of the arithmetic laws of the continuum and 
the arithmetic properties of individual irrational numbers. The appa- 
ratus of continued fractions is an irreplaceable tool for theoretical in- 
vesligations, and the prime purpose of all that follows will be its appli- 
cation to that purpose. 


6. Convergents as best approximations 


To represent an irrational number a as an ordinary rational fraction 
(to within a specified margin of accuracy), it is natural to use the con- 
vergents of the continued fraction representing a. The degree of ac- 
curacy of this approximation is given by Theorems 9 and 13 of Chap- 
ter I. Specifically, we have 


1 | a Pn 


l 
In lIn F an+) HS Vn 


> GnIn+i : 
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The problem of approximating irrational numbers by rational frac- 
tions consists, in its simplest form, of determining which of the fractions 
that differ from the given irrational number by not more than a speci- 
fied amount has the lowest (positive) denominator. The problem (stat- 
ed in this manner) can be meaningful even in the case in which the 
number a is rational. For example, if a is a fraction with an extremely 
large numerator and denominator, we may want to approximate this 
number by a fraction whose numerator and denominator are smaller. 
From a purely practical point of view, there is no real difference be- 
tween these two cases (rational and irrational a), since, in practice, 
every number is given with only a certain degrce of accuracy. 

It is immediately clear that the apparatus of systematic fractions is 
completely unsuitable for solving this problem, since the denominators 
of the approximating fraction that it provides are determined exclusive- 
ly by the chosen system of calculation (in the case of decimal fractions, 
they are powers of ten); hence, the denominators are completely inde- 
pendent of the arithmetic nature of the number represented. On the 
other hand, in the case of a continued fraction, the denominators of 
the convergents are completely determined by the number repre- 
of the convergents are completely determined by the number repre- 
sented. We, therefore, have every reason to expect that these conver- 
gents (since they are connected in a close and natural way with the 
number represented, and are completely determined by it) will play a 
significant role in the solution of the problem of the best approxima- 
tion of a number by a rational fraction.’ 

Let us agree to call a rational fraction a/b (for b > 0) a best approxi- 
mation of a real number a if every other rational fraction with the same 
or smaller denominator differs from a by a greater amount, in other 
words, if the inequalities 0 < d < b, and a/b ¥ c/d imply that 


C a 
ja—5|>|e—$|. 


2 Two interesting algorithms for representing irrational numbers were advanced 
by M. V. Ostrogradskil shortly before his death. His brief notes on the matter were 
discussed on bits of paper in the manuscript depository of the Academy of Sciences 
of the Ukrainian SSR. These notes were deciphered in an article by E. Ya. Remez, 
“O znakoperemennykh ryadakh, kotorye mogut byt’svyazany s dvumya algorif- 
mami M. V. Ostrogradskogo dlya priblizheniya irratsional’nykh chisel” (“Alternat- 
ing series that may be connected with two algorithms of M. V. Ostrogradskii for 
approximating irrational numbers”), Uspekhi matematicheskikh nauk, 6, No. 5 (45), 
33-42 (1951). As Remez discovered, Ostrogradskii’s algorithms give better approsi- 
mations than continued fractions in certain cases. Unfortunately, no detailed study 
of these algorithms, even for computational purposes, has as yet been made. (B. G.) 
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THEOREM 15. Every best approximation of a number a is a convergent 
or an intermediate fraction of the continued fraction representing that 
number. 

PRELIMINARY REMARK. For this proposition to have no exceptions 
it is necessary, as we agreed in section 2, to introduce into our consider- 
ations convergents of order — 1, by setting p-; = 1 and q-ı = 0. For 
example, the fraction 4 is, as we can easily verify, a best approximation 
of the number 1; however, it is not one of the convergents or intermed- 
iate fractions of that number, since the set of these fractions (if we 
begin with the convergents of order zero) consists of only two numbers, 
namely, 2 and 4. However, if we take the fraction 3 as a convergent 
of order —1, this set will consist of 


thus including the fraction 4. 

Proor. Suppose that a/b is a best approximation of the number a. 
Then, first of all, a/b > ao, because if a/b < ao, the fraction ao/1 (be- 
ing distinct from a/b and having a denominator that is no greater 
than b) would lie closer to a than does a/b. Therefore, a/b would not 
be a best approximation. 

In a similar manner, we can show that 


$ <+ L 


Thus, we know that ao < (a/b) < ao+ 1. If a/b = avor a/b = ao + 1, 
the conclusion of the theorem would be evident since a&o/1 = po/qo is 
a convergent and (ao + 1)/1 = (fot #-1)/(go + g_1) is an interme- 
diate fraction of a. 

If the fraction a/b does not coincide with any convergent or inter- 
mediate fraction of the number a, it must lie between two consecutive 
such fractions. For instance, for properly chosen k and r (with k > 0, 
O0 <r < aiy,0rk = 0,1 <7 < aj), it will lie between the fractions 


and 
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so that 


a Pl + Pp | 


a ik lIR- P +i) EP, PT Pp_y 
b atha 
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Cea ioe? Bok ee Ce a 


But, on the other hand, it is obvious that 


a Pa Pp, 


b Iar Faga 








a: m 
bar tI) 


where m is a positive integer and hence is at least equal to unity. Con- 
sequently, 








1 1 
b (ar Faga) i (a EDHI) Ir Fa 
and hence, 


Irr +1) Fagg- <8. 


Ph DEP, 
q,(°+1)+9,_,° 


with denominator less than 8, is closer to the number a than is the frac- 
tion 


The fraction 


(25) 


Pe t Ppt (26) 
a tagi 


(because, in general, from the result of sec. +, every intermediate frac- 
tion is closer to a than is the preceding one) and hence, is also closer 
than is the fraction a/b, which lies between expressions (25) and (26). 
However, this contradicts the definition of a best approximation, thus 
proving Theorem 15. 

In the definition of the concept of best approximation, which is at 
the basis of this theorem, we evaluated the closeness of the rational 
fraction a/b to the number a in terms of the smallness (in absolute 
value) of the difference a — (a/b) (which, of course, is the most natu- 
ral procedure). However, it is often more important or convenient in 
number theory to examine the difference ba — a, which differs from 
the preceding one only by the factor b. Thus, the smallness of this dif- 
ference (in absolute value) can also serve as a measure of the closeness 
of the fraction a/b to the number a. This change from one characteris- 
tic to another may at first glance seem trivial, and frequently it is. 
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Howevcr, this is not always the case, as we shall soon see. The signifi- 
cant point is that the factor 6 is not a constant, but is dependent on 
the approximating fraction itself, and changes when this fraction is 
changed. 

Let us now agree to refer to those best approximations mentioned 
in Theorem 15 as best approximations of the first kind. Let us further 
agree to call the rational fraction a/b (where b > 0) a best approxima- 
lion of the second kind of a number a if the inequalities c/d Æ a/b and 
0<d< bimply 


|da — c| >|ba—al. 


Best approximations of the second kind are thus defined in terms of 
the characteristic |ba — a| in a manner completely analogous to the 
definition of best approximations of the first kind in terms of the 
characteristic |a — a/b]. 

It is easy to show that every best approximation of the second kind 
must necessarily be a best approximation of the first kind. For if 


t-il] (H$ e<) 


on multiplying the first of these inequalities by the third, we would 
obtain 





|da — c| <|ba— a|; 


in other words, if the fraction a/b was not a best approximation of the 
first kind, it could not be a best approximation of the second kind. 

The converse is not true: a best approximation of the first kind can 
fail to be a best approximation of the second kind. For example, the 
fraction 4 can easily be shown to be a best approximation of the first 
kind of the number 4. However, that it is not a best approximation of 
the second kind is seen from the inequality 


|1-5—0]< 3-5! | (1 < 3), 


It follows from these remarks and from Theorem 15 that all best 
approximations of the second kind are convergents or intermediate 
fractions. However—and here lies the fundamental significance of the 
apparatus of continued fractions in finding best approximations of the 
second kind—we can make a much stronger assertion. 

THEOREM 16. Every best approximation of the second kind is a con- 
verge. 








REPRESENTATION OF NUMBERS 25 


Proor. Suppose that a fraction a/b is a best approximation of the 
second kind of the number 


a = [lg a), dy, ...], 
whose convergents will be denoted by :/q;. If a/b < ao, we would 
obtain 


|1 -a— a] <|a—$| <|b2—a (<b), 


that is, a/b would not be a best approximation of the second kind. 
Thus, a/b > ao. But then the fraction a/b, if it did not coincide with 
one of the convergents, would either lie between two convergents 
pr-1/qk—1 and Pr+1/qr+1, Or would be greater than p1/qı. In the first 
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b> fpg (27) 
on the other hand, 
ee E : 
b Igy KITS 
and hence, 
|ba—a | = ’ 

Tet 
whereas 

l 

| qka — Pr | < q , 

k+l 

so that 
lga — Pr) <S [ba — a]l. (28) 


Inequalities (27) and (28) show that a/b is not a best approximation 
of the second kind. 
In the second case (that is, if a/b > p:/qi), we have 


a Ba aly te 
ja—F|>|2 D |> bq’ 





so that 
l 
| ba — a| ar a Ti 


26 CONTINUED FRACTIONS 
On the other hand, it is obvious that 


l 
[l-a—al<z. 


so that 

|ba—a|>|1-a—a)| 0 <b), 
which again contradicts the definition of a best approximation of the 
second kind. This proves Theorem 16. 

Let us now consider the converse of Theorems 15 and 16. That the 
converse of Theorem 15 is false can be seen by considering 4, which, 
as is easily shown, is an intermediate fraction for the number a = $, 
while it is not a best approximation, since 


$i <|- (1 <2). 


There are many more such examples, as the reader can verify for him- 
self. 

On the other hand, Theorem 16 does have an almost complete con- 
verse, which, of course, greatly enhances its value. 

THEOREM 17. Every convergent is a best approximation of the second 
kind, the sole exception being the trivial case of 

l Po Qo 
R Maas 

PRELIMINARY REMARK. In the case of a = ao + 4, the fraction 

po ‘Go = 4/1 is not a best approximation of the second kind because 


|1 -2—(@)+1)]=1]1-a—ag. 


Proor. Let us examine the expression 


where y takes the values 1, 2, +++ , gx, and x can take arbitrary in- 
tegral valucs. We denote by yo that value of y for which expression 
(29), after suitable choice of x, takes the smallest possible value. (If 
there are several such values of y, we take the smallest of these for yo.) 
We denote by xo that value of x at which |yoa — x| attains its mini- 
mum. It is casy to see that this value is unique. For if 














(Xp Æ xo) 


REPRESENTATION OF NUMBERS 27 


we would have 


Xo + Xo 
2Yo 


This fraction is irreducible. For if xo + x’) = Jp and 2y9 = lg (with 
l > 1), we would have, for / > 2, 


paip 2=2, jesen 
which contradicts the definition of yo; and for / = 2, we would have 


q = yo and 
[ga —p|=] ya —p|=0 < | y2 — xol, 


which contradicts the definition of xo. 
Expanding the rational number a as a continued fraction, we thus 
obtain 


/ 
Bo: Pare soap 


dn = 2Y = rE Faro a, = 2, 
so that if a, > 2 or if a, = 2 and n > 1, we have gp_1 < yo. But 


l l l 
lga- — Pn- l 5 =a S y Sla — Xol. 


which contradicts the definition of yo. If n = 1 and a, = 2, we have 
a = do + 4 and yo = 1, which is the one exceptional case. 

Thus, the values yo and xo are uniquely defined by the given condi- 
tions. It directly follows from this that xo/yo is a best approximation 
of the second kind for the number a, since the inequalities 

Xo 
[ba — a| < | y2 — xol, Bey,” b< Nor 
would obviously contradict the definitions of xo and yo. From Theo- 
rem 16, we therefore have 


Xo 5Ps Yo is (s < k). 


If s = k, the theorem is proved. But if s < k, we obtain 


l l l 
—x SS eS IE | 1. <—, E 
| 75% Ps | > I+ 4% 54) = Gp + 4p PA Pe | PE 
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and from the definitions of the numbers fs = vo and gs = yo, we would 
have 


lga — Ps| S | 9ga — Del. 


so that 
1 l 
E TEEN 
Ig- Ik ~ Ikr 


that is, 


Iry < Ir F ar-r 


which is impossible because of the rule by which the numbers gq; are 
formed. This completes the proof of Theorem 17. 

Those properties of the apparatus of continued fractions that we 
have established in the present section were, historically, the original 
reason for the discovery and study of that apparatus. When Huygens 
set about constructing a model of the solar system by using toothed 
wheels, he was confronted with the problem of determining what num- 
bers of teeth for the wheels would give a ratio for two interconnected 
wheels (equal to the ratio of their periods of rotation) that would be 
as Close as possible to the ratio a of the periods of revolution of the cor- 
responding planets. At the same time, the number of teeth obviously 
could not, for technical reasons, be too high. Thus, Huygens’s problem 
was Lo find a rational number with numerator and denominator not 
exceeding a certain bound that would still be as close as possible to 
the given number a. (The number a might theoretically be irrational, 
but, in practice, it is assumed, in a given case, to be a rational fraction 
with very large numerator and denominator.) We have already seen 
that the theory of continued fractions provides the means of solving 
this problem. 


7. The order of approximation 


In the preceding section, we were concerned with evaluating the small- 
ness of the difference |a — (px/g.)| in comparison with other differ- 
ences of the same type. Here, we shall make an absolute evaluation of 
this difference. Obviously, the only way of evaluating the smallness 
of |a — (p/q) | consists in comparing it with some decreasing func- 
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tion of gx. To this end, Theorem 9 of Chapter I leads us directly to the 
inequality? 


p I 








Therefore, the question must arise as to whether we can strengthen 
this inequality, that is, replace its right side with another function 
f(qx) of the denominator g, that, for all n > 1, would satisfy the in- 
equality 


f<. 


It is easy to see that if we want this strengthened form of inequality 
(30) to be satisfied for arbitrary a at all values of k, no significant 
strengthening in this direction is possible. More precisely, for any 
e > 0, we can always find a case for which 











p l—:e 
E 5 
Vp dk 
To show this, we need only examine the number 
— 0: _ _Aa+tl 
a = [0; n, I, "= TaD’ 


for which, 


P=), q=n, py=nt+l, qa=n(n +2), 


and therefore, 
aa 


Bs Pi). ae ee, 
qi 


q3 qı 





If we now choose # such that 





we have 





3 If a = px/qz (when Theor. 9 is inapplicable because there is no gı), inequality 
(30) becomes trivial. 
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However, if we relax the requirement that the strengthened inequal- 
itv be satisfied for arbitrary a at all values of k (without exception), 
we can then obtain a number of interesting and important propositions, 
as we proceed to demonstrate. 

THEOREM 18. If a number a has a convergent of order k > O, ai least 
one of the following two inequalities must hold: 











1 ] 
fee os p= ga ° 
Gp 2q% Jk- 2gp) 





Proor. Since a lies between px_1/qe_1 and px/ge, we have 
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(The inequality expresses the fact that the geometric mean of the 
quantities 1/g? and 1/g?_, is less than their arithmetic mean; equality 
would be possible only if gz = gx—-1, which in the present case is ruled 
out.) The assertion of the theorem follows immediately. 

This proposition is interesting because it has a converse (in a cer- 
tain sense). 

THEOREM 19, Every irreducible rational fraction a/b that satisfies the 
tnegualtly 


a l 
[= 5| <7 
is a convergent of the number a. 
ProoF. On the basis of Theorem 16, it is sufficient to show that the 


fraction a/b is a best approximation of the second kind of the number a. 
Suppose that 


l c a\. 
|d1—ce| <|ba—al < sy (a >0, +T) 
then, 


c l 
a— -F| < z 





and, consequently, 


giic |+ 








a 1 1 
a— $| < aa tHE mg: CD 





On the other hand, since c/d ¥ a/b, we have 


c a 1 
-$ > td" 
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Therefore, inequality (31) implies 


1 > b4+d 
bd < d ’ 

so that d > b. Thus, the fraction a/b is a best approximation of the 
second kind of the number a and Theorem 19 is proved. 

A further strengthening of Theorem 18 is the following, consider- 
ably more profound theorem. 

THEOREM 20.4 Jf a number a has a convergent of order k > 1, at least 
one of the following three inequalities must hold: 








Pr l Pp} | ] 
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Pro l 
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Proof. Let us define, for k > 1, 
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Vp) 
LEMMA. [fk > 2, Yr < V5, and Yuu < V5, then 
V 5-1 
Pe > —z 
PRooF. Since 
l dn 39 
= ——_ = A ( ) 
Ọn+1 dn-ı n + Pn 
and i 
a ae OF) 
it follows that 
1 1 
= T p = ş 
Prati n+! Pa . Yn 


and, from the conditions of the lemma, 
= l 1 = 
Cetre<V5, re Ma da 


4 Some simplification of the proof given here appears in the article by I. I. Zhogin, 
“Variant dokazatel’stva odnoi teoremy iz teorii tsepnykh drobei” ( A variation of 
the proof of a theorem in the theory of continued fractions”), Uspekhi matematiche- 
skikh nauk, 12, No. 3, 321-322 (1957). 
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so that 
(V 5— ee) (V5— =) ali 
Yk 
or. since œ; is a rational number, 


5— V5 (et) > 0 


Then, since ¢ > 0, we obtain 


and, consequently, 


Ve V5—1 


<a: Cp > 9 , 





which proves the lemma. 
Let us now suppose that, in contradiction to our assertion, 


mee 


(n=k, R—1, R—2). 








> Fgh F 2 
From formula a of Chapter I, we have 
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and, consequently, 
Ony S V5 (n=k, kR—1, k—9). 
We conclude, on the basis of our lemma, that 


o, > V31 Ta 
er 





, Prt. > ’ 


and hence, because of (32), 





a =——%< a 1. 


1 
Pett V5—1 2 
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which is impossible. This contradiction completes the proof of Theo- 
rem 20. 

Theorems 18 and 20 give the obvious impression of the beginning 
of a series of propositions that will admit yet further extension. How- 
ever, this impression is erroneous. Consider the number 


a = [1; 1, 1,...]. 


Assuming, as usual, that a = 1 + (1/71), we obviously have r, = a, 
so that 


a = l+ —, g? — qa — 1 = 0, 


and, consequently, 











a= EVS 
2 . 

Since, obviously, 7, = a for arbitrary 7, we have 

__ PtP i 

1T F a f 
and, consequently, 
P, 1 l 
ETA 
k 


But from Theorem 6 of Chapter I, we have 





1r = [l; 1, 1,..., I)a as k>% 
Tp 
so that 
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This shows that. no matter what the number ¢ < (1/./5) may be, for 
sufticiently large $, we will obtain 








Thus, the constant 1/ v5 in Theorem 20 cannot be replaced by any 
smaller constant if we wish the corresponding inequality to be satisfied 
for an infinite set of values of k with arbitrary a. For every smaller con- 
stant, there exists an a (namely, a = $(./5 + 1)] that can satisfy the 
required inequality for no more than a finite number of values of &. 
Thus, the chain of propositions that begins with Theorems 18 and 20 
is broken after the latter theorem, and admits no further continuation. 


8. General approximation theorems 


Up to now, we have been primarily interested in approximations 
given by convergents and have clarified a number of fundamental 
questions associated with this problem. Since we have seen that the 
convergents are best approximations, we may assume that the ob- 
tained results will allow us to study, in full measure, the rules that 
govern the approximation of irrational numbers by rational fractions, 
independently of any particular representing apparatus. We now turn 
to problems of this type. It is, of course, impossible (within the frame- 
work of the present elementary monograph) to give any sort of com- 
plete exposition of the fundamentals of the corresponding theory, part- 
ly because of lack of space, but primarily because such an exposition 
would have only an indirect bearing on our problem. We shall confine 
ourselves to presenting a number of elementary propositions, which will 
illustrate the application of continued fractions to the study of the 
arithmetic nature of irrational numbers. 

The first problem that naturally arises in connection with the results 
of the preceding section may be formulated as follows: For what con- 
stants c does the inequality 


-4< e 


have an infinite set of solutions in integers p and q, q > 0, for arbi- 
trary rcal a? The final result of the preceding section leads us to the 
following theorem. 

THEOREM 21. Inequality (33) has an infinite set of solutions in in- 
legers p and q (q > 0) for arbitrary real a if c > (1/+/5). However, if 
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c < (I/V5), inequality (33) will, for suitably chosen a, have only a 
finite number of such solutions. 

The first assertion is an immediate consequence of Theorem 20. (In 
the case in which a is a rational number a/b and, therefore, has only a 
finite number of convergents, the first assertion of Theorem 21 can be 
proved in a trivial manner by setting q = ub and p = na, for n = 1, 
2, 3, +++). Suppose, then, that c < (1/V 5). As in section 7, let us set 


al t¥S 1, bs te cack: 


If two integers p and g (q > 0) satisfy inequality (33), Theorem 19 
tells us that p/g is a convergent of the number a. But we saw at the 
end of section 7 that only a finite number of these convergents satisfy 
inequality (33) under our hypothesis that c < (1/4/5). This proves 
our assertion. 

Thus, in general (that is, if we consider all possible real numbers a), 
the order of approximation characterized by the quantity 1/(/5g°) 
cannot be improved. (The term “order of approximation” refers to that 
magnitude of error within which a suitable estimate can always be 
found.)® This does not mean that there are no individual irrational 
numbers for which approximations of much higher order are possible. 
On the contrary, the possibilities in this direction are boundless —a fact 
that is most easily shown by the apparatus of continued fractions. 

THEOREM 22. For any positive function (q) with natural argument q, 
there is an irrational number a such that the inequality 


Ja— 2| <eo 


has an infinite number of solutions in integers p and q (q > 0). 

PROOF. Let us construct an infinite continued fraction a by choos- 
ing its elements successively in such a way that they will satisfy the 
inequalities 

1 
Opay > a eee (k = 0). 
IR? (Ip) 

This, of course, can be done in an infinite number of ways. Here, do can 

be chosen arbitrarily. Then, for any k = 0, 
1 1 1 
4 — Pr == < 3 < $ (dp). 
Tp Iriri Tr lCrtiIr T ar-) Paste 


which proves the theorem. 

















5 Translation editor’s note. 
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We now note that, in the most general case, the inequalities 
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from which it is clear that, for given go, @1, °** , ak, the greater the 
subsequent element @x+1 is, the more closely the fraction p,/g, will 
approximate the number a. And since the convergents are, in all cases, 
best approximations, we arrive at the conclusion that those irrational 
numbers whose elements include large numbers admit good approxima- 
tion by rational fractions. This qualitative remark is expressed quan- 
titatively in inequality (34). In particular, irrational numbers with 
bounded elements admit the worst approximations. Thus, it becomes 
clear why we have repeatedly chosen the number 


Anam I l,a] 


when we wished to exhibit an irrational number that did not admit 
approximations of higher than a fixed order. Of all irrational numbers, 
this clearly has the smallest possible elements (excluding ao, which 
plays no role here) and hence is the most poorly approximated by ra- 
tional fractions. 

Those approximating properties that are peculiar to numbers with 
bounded elements are completely expressed in the following proposi- 
tion, which, after what has already been said, is almost obvious. 

THEOREM 23. For every irralional number a wilh bounded elements, 
and for sufficiently small c, the inequality 


a c 
q — — — 
| q < q? 


has no solution in integers p and q (q > 0). On the other hand, for every 
number a with an unbounded sequence of elements and arbitrary c > 0, 
inequality (33) has an infinite set of such solutions. 
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In other words, irrational numbers with bounded elements admit 
an order of approximation no higher than 1/g?, while every irrational 
number with unbounded clements admits a higher order of approxima- 
tion. 

ProoF. If the set of elements of the continued fraction representing 
a is not bounded above, then for arbitrary positive ¢ there is an infinite 
set of integers & such that 


1 
Gnu >>? 


and, consequently, on the basis of the second of the inequalities in (34), 
there is an infinite set of integers k such that 








Pp c 
oa * <5 
Te dg 


which proves the second assertion of the theorem. If there exists an 
M > 0 such that 


a, < M (k=1, 2,...), 


then, on the basis of the first of the inequalities in (34), we have, for 
arbitrary k = 0, 


l 
ER 


qg, | %(M+2) 


Now let p and q be arbitrary integers (q > 0), and let k be deter- 
mined by the inequalities 
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Then, since all convergents are best approximations of the first kind, 
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Thus, if we choose 


} 
E< FD M+ 1 


inequality (33) cannot be satisfied for any pair of integers p and q 
(y > 0). This proves the first assertion of the theorem. 

Up to this point, we have always evaluated the closeness of an ap- 
proximation in terms of the smallness of the difference a — (p/q); 
however, we might have considered instead the difference ga — p (as 
in sec. 6), making the appropriate changes in the formulation of all 
the theorems. This simple observation leads directly to a certain new 
and extremely important aspect of the problem that we are studying. 

The simplest homogeneous linear equation with two unknowns x 
and y, namely, 


ax — y=0, (35) 


where a is a given irrational number, obviously cannot be exactly 
solved in whole numbers (except, of course, in the trivial case of 
x = y = 0). However, we may pose the problem of obtaining an ap- 
proximate solution, that is, of choosing integers x and y for which the 
difference ax — y is sufficiently small (that is, less than a preassigned 
amount). Obviously, all the preceding theorems of this section can 
be interpreted as confirmation of the rules governing this kind of ap- 
proximate solution to equation (35) in whole numbers. Thus, for ex- 
ample, Theorem 21 shows that there always exists an infinite set of 
pairs of integers x and y (x > 0), such that 


C 


for any positive C greater than or equal to 1/ v5. 
With this approach, it is natural to pass from the homogeneous equa- 
tion (35) to the non-homogeneous equation 


ax — y =ß (37) 


(where 6 is a given real number) and to investigate the existence and 
nature of its approximate solutions in integers x and y (in other words, 
to investigate the principles involved in attempting to make the dif- 
ference ax — y — $B as small as possible by a suitable choice of integers 
x and y). This problem was first posed by the great Russian mathe- 
matician P. L. Chebyshev, who obtained the first basic results con- 
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nected with it, and has been the subject of continued intensive study, 
especially by the Soviet arithmetic school. 

The first basic feature distinguishing the non-homogeneous case 
from the homogeneous one is that it is possible to make the quantity 
|ax — y — | arbitrarily small for arbitrary 8 by a suitable choice 
of x and y only if the number ais irrational (whereas, in the homogene- 
ous case, the quantity |ax — y| can be made arbitrarily smali for 
any a). In fact, if a = a/b, where b > 0 and a are integers, then, by 
setting 8 = 1/26, we obtain, for arbitrary integers x and y, 


vai | 2(a*— by) — 1 l 
premye e 


since |2(ax — by) — 1|, being an odd integer, is at least equal to 
unity. 

Thus, in all that follows, we shall assume a to be irrational. With 
this understanding, we shall now show that not only is it also possible 
to make the quantity |ax — y — ß| arbitrarily small, but the analogy 
with the homogeneous case can be extended considerably further. 

THEOREM 24 (Chebyshev). For an arbilrary irrational number a and 
an arbilrary real number B, the inequality |ax — y — B| <3/x has an 
infinite set of solutions in integers x and y (where x > 0).° 

PRELIMINARY REMARK. Obviously, this result is completely analo- 
gous to the corresponding problem for homogeneous equations, cx- 
pressed in Theorem 21. The difference consists only in the fact that 
here, instead of 1/ V5, we have 3. The order of the approximation is 
the same as before. We note also that the number 3 is not the best pos- 
sible and that the exact value of the greatest lower bound of the set of 
numbers’that would verify Theorem 24 is considerably less than 3. 

ProoF. Let p/q be an arbitrary convergent of a. We then have 


ò 
tas Ot (0<|8] <1); (38) 
also, for any real 8, we can find a number # such that 
1 


6 A simple proof of a somewhat stronger theorem is found in Khinchin’s article, 
“Printsip Dirikhle v teorii diofantovykh priblizhenij” (“Dirichlet’s principle in the 
theory of Diophantine approximations”), Uspekhi matematicheskikh nauk, 3, No. 3, 
17-18 (1948). Further refinements are contained in Khinchin’s article, “O zadache 
Chebysheva” (“On a problem of Chebyshev”), Jzvestiva akad. nauk SSSR. ser. 
matem., 10, 281-294 (1946). (B. G.) 
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so that 
aa (|e | <1) (39) 
q 2q en 


Since p and g have no common divisors other than + 1, there exists 
a pair of integers x and y such that 


o<x< <a px —qy =t. 


For if r sis the convergent immediately preceding $/g, 
qr — ps =£ = +], q (ert) — p (est) = te? = t, 
and for an arbitrary integer k, 
p (kq — est) — q (kp — crt) = f; 


but & can be chosen so that 





q eee 3q 
yt =kg est < a 
Then, on the basis of equations (38) and (39), 
ii Sana he Pe a ie 
| y— p| M ay: 
xd ò’ x 1 
“ë gl ety 
and since 
2 
(>F, 
we have 
. 9 3 3 
== 8 | aoa = 


Finally, since g can be chosen arbitrary large and since x > g/2, it 
follows that x can be arbitrarily large. This proves the theorem. 

But the problem of an approximate solution to equation (37) in 
whole numbers can be put in a different, and somewhat more natural, 
form. Since the crux of the problem is to make the quantity |ax — 
y — B| as small as possible with as large integral values of x and y as 
possible, it is most natural to state the problem in the following man- 
ner. We know (from Theor. 24) that, for any positive number 7 (no 
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matter how large), any irrational number a, and any real 8, we can 
find integers x > 0 and y satisfying the inequality 


jax—y—pl<t. (40) 


However, Theorem 24 does not generally give us any information as 
to the limits within which we should seek these numbers so as to attain 
the required accuracy, characterized by the quartity 1/2. This might 
be achieved, for example, if we could exhibit some number V, depend- 
ent on z, but independent of a and 8, such that inequality (40) would 
always be satisfied under the additional condition that 


x| <N. 


This new statement of the problem is obviously quite different from 
the original one. Whereas formerly (as in Theor. 2+) the accuracy of 
approximation was determined by the value of x, we now wish to fix 
this accuracy in advance and sce how large a value of x we should 
choose to attain this accuracy. The solution to the problem is signili- 
cantly altered by this difference in its statement. Specifically, we obtain 
quite different results in the homogeneous and non-homogencous cases. 

In the case of a homogeneous equation (8 = 0), the stated problem 
has a very simple solution. 

THEOREM 25. For all real numbers n > 1 and a, there are integers 
x and y satisfying the tnequaltties 


O<Sx Caz, jax—yl< a. 
PROOF. If a is a rational number, a/b, such that 0 < b < n, the 
conclusion is immediate for x = b and y = a. If a either is irrational 
or has a denominator exceeding n, we define k by the relationship 


Jr SP < Vest 
(where ;/gs denotes the &th-order convergent of a) and obtain 
Pr l l 
a — = | < ——— < —, 
Tp Teire Tr” 








so that 
1 
[age — Prl < F’ O< 4,0, 


which proves the theorem. 
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Now. we naturally ask whether we may obtain the same order of 
approximation in the case of the non-homogeneous equation (37). In 
other words, may we assert that, for any irrational number a, a positive 
number C can be found such that, for any n > 1 and £, there exist 
integers x and y satisfying the inequalities 


O<x <Cn, Jax —y— I<}? 


(Clearly, we are now asking even less than for the homogeneous case, 
since we are allowing C to depend on a, whereas in the homogeneous 
case, C = 1 was an absolute constant.) It is easy to give certain a priori 
arguments against the possibility of such an assumption. First, for ra- 
tional a, it is clearly untrue, since, as we have seen, the quantity 
‘ax — y— 8, cannot in general (that is, with arbitrary 8) be made 
arbitrarily small. This leads us to expect that if a is irrational (but is 
approximated extremely closely by rational fractions), the quantity 
ax — y — @., even though it can (on the basis of Theor. 24) be made 
arbitrarily avail requires comparatively large values of x and y to 
accomplish this with a suitably chosen value of 8. These considerations 
also lead us to suppose that the more poorly the number a is approxi- 
mated by rational fractions (that is, the more difficult it is to make the 
quantity av — y approach zero), the easier it will be to have the dif- 
ference ax — y approach an arbitrary real number 8. As we know, this, 
in turn, requires that the elements of the number a not increase too 
fast. All these preliminary considerations are expressed precisely in the 
following theorem. 
THEOREM 26. For the existence of a positive number C with the prop- 
erty that, for arbitrary real numbers n > 1 and B, two integers x and y 
(x > 0) exist satisfying the inequalities 


1 
x<Cn, lax —y— I< 7. 


it is necessary and sufficient that the irrational number a be represented 
by a continued fraction wilh bounded elements. 

PROOF. Suppose that a = [d@9; a1, de, ++°*], that a; < M (for i = 
1, 2, +1), that m > 1, and that 8 is an arbitrary real number. De- 
noting by px/g, the convergents of the number a, we can determine 
the subscript & from the inequalities 


q Sm < aker 
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then, 
a a ae ae 
qT, Tepe) -ma 
or 
De g 8 . 
Se a (èl < 1). (41) 
We now choose an integer ¢ such that 
124% —tl<z, 
so that 
p=— +5 (le7]<1 
7, ' %, b <1). (42) 


Finally, as in the proof of Theorem 24, we find a pair of integers x and y 
satisfying the relationships 


XPy—YVIp=b VOX KG. (43) 
It follows from (41), (42), and (43) that 
XP, t xÙ a3 
ax — y — 8 | = | — — y — — 4+ — — __. 
| y— $| | ie ae? eo 
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Up to now, the number m > 1 has been completely arbitrary. If 
we now set m = (M + 3)n for given u > 1, we shall obviously have 
m > 1. Consequently, from what was stated above, if we choose the 
numbers x and y as indicated, we have 


O< xg < m= ats on, 





l 
= ay A 
lax — y mae 


which proves the first part of the theorem. 
To prove the second part, let us suppose that the sct of elements ng 
of the number a is unbounded above. Theorem 23 than indicates that, 
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in this case, for any positive number e there are integers g > 0 and p 
satisfying the inequality 


Pje Ë 
E q |< g’ 
so that 
Pp de? a 1 
taa T g? ([è] < )- 


We now set z = g/eand 8 = 1/2g. Then, for arbitrary integers x and y 
(with O < x < Cn), we obtain 


xp I Xbe2 | 2(xp—yq)—l1 , xbe? 

xX — — = | = — — — = | m —— 
|jax—y—B| q > 5g T g? Jg F q? 
~ 2(xp—y—1l xè. l Ce 1—2?Ce 1—2Ce 1 
g 2g g@~ ua q 2q o w n’ 





But no matter how large C may be, for sufficiently small e, we shall 
have [(1 — 2Ce)/2e] > 1, and, consequently, for arbitrary integers x 
and y (with O0 < x < Cu), we obtain 


jax—y—B[>-, 


which proves the first part of the theorem. 

Let us review the results that we have obtained. In investigating 
the approximate solutions Lo equation (37) in whole numbers, we must 
examine as a ‘“‘normal’’ case the one in which the accuracy character- 
ized by the quantity 1/2 can be attained for arbitrary n > 1 at some 
x < Cu, where C is a constant (possibly depending on a). A homogene- 
ous equation (obtained for 8 = 0) always has a normal solution (Theor. 
25). Theorem 26 shows that the general (non-homogeneous) equation 
has a normal soiution if, and only if, the corresponding homogeneous 
equation has no “‘supernormal”’ solution (that is, if it is impossible to 
satisfy the homogeneous equation with integers x > 0 and y such that 
x < en for arbitrary «e > 0 and properly chosen z, with an accuracy 
of 1/z). From this point of view, the results of our investigation can 
be regarded as a variation of the general law concerning the solution 
of linear equations (algebraic, integral, etc.): Zz the general case, a non- 
homogeneous equation can be solved “normally” if the corresponding homo- 
geneous equation has no ‘‘supernormal” solution. 

We note also that in Theorem 26 we required that C be independent 
of B. The same result would hold if we allowed C to be a function of £, 
hut the proof (second part) would be somewhat more complicated. 
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9. The approximation of algebraic irrational numbers 
and Liouuville’s transcendental numbers 


Suppose that 
f (x)= aax ... tax" (44) 


is a polynomial of degree n with integral coefficients do, di, +++ , dy. 
Then, a root, a, of this polynomial is said to be algebraic. Since ev cry 
rational namaker a = a/b can be defined as the root of the first- degree 
equation bx — a = 0, the concept of an algebraic number is clearly a 
natural generalization of the concept of a rational number. If a given 
algebraic number satisfies an equation f(x) = 0 of degree n, and (oe 
not satisfy any equation of lower degree (with integral coeflicients), 
it is called an algebraic number of degree w. In particular, rational 
numbers can be defined as first-degree algebraic numbers. The num- 
ber \/2, being a root of the polynomial x? — 2, is a second-degree alge- 
braic number, or, as we say, a quadratic irrational. Cubic, fourth-de- 
gree, and higher irrationals are defined analogously. All non-algcbraic 
numbers are said to be transcendental. Examples of transcendental 
numbers are e and z. Because of the great role that algebraic numbers 
play in contemporary number theory, many special studies have been 
devoted to the question of their properties with regard to their approxi- 
mation by rational fractions. The first noteworthy result in this direc- 
tion was the following theorem, known as Liouville’s theorem. 
THEOREM 27. For every real irrational algebraic number a of degree n. 
there exists a positive number C such that, for arbitrary integers p and q 


(q > 0), 
-4> 


PRrooF. Suppose that a is a root of the polynomial (44). From alge- 
bra, we may write 


f(x) = (x — 2) fi (%), (49) 


where fi(x) is a polynomial of degree n — 1. Here, fila) 4 0. To show 
this, suppose that fı(a) = 0. Then, the polynomial fi(x) could be 
divided (without a remainder) by x — a and, hence, the polynomial 
f(x) could be divided by (x — a)*. But, then, the derivative f'(x) could 
be divided by x — a; that is, we would have f’(a) = 0, which is im- 
possible since f'(x) is a polynomial of degree n — 1 with integral co- 
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efficients and a is an algebraic number of degree z. Hence, fila) Æ 0, 
and, consequently, we can find a positive number ô such that 
f(x) #0 (a—8<¢x<a+t8). 
Suppose that p and q (q > 0), are an arbitrary pair of integers. If 


a — (p,g)| < ô, then f1(p/g) ¥ 0, and, by substituting x = p/q in 
identity (45), we obtain 





The numerator of this fraction is an integer. It is also non-zero, be- 
cause otherwise we would have a = p/q, whereas a is by hypothesis 
irrational. Consequently, this numerator is at least equal to unity in 
absolute value. We denote by M the least upper bound of the function 
fi(v) in the interval (a — 6, a + 6). From the last inequality, we thus 
obtain 





_P l 
|a q |> Mq” 
In the event that 
p | ` 
a — — ò, 
homeo a 
it follows that 
Pe ig 
E q |> q7’ 


and if we now denote by C any positive number less than 6 and 1/M, 
we obtain, in both cases (that is, for arbitrary g > 0 and $), 


p G 
4 — — Tn ? 
| q > q” 


which completes the proof of Theorem 27. 

Liouville’s theorem shows that algebraic numbers do not admit ra- 
tional-fraction approximations of greater than a certain order of ac- 
curacy (this depending basically on the degree of the algebraic number 
in question). The main historical importance of this theorem consisted 
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in the fact that it made possible the proof of the existence of transcen- 
dental numbers, and enabled one to give specific examples of such 
numbers. As we have scen, to do this, it is sufficient to exhibit an irra- 
tional number for which rational fractions give extremely close approxi- 
mations, and theorem 22 shows that the possibilities for this are un- 
limited. 

Specifically, Theorem 27 shows that if for arbitrary C > 0 and arbi- 
trary natural » there exist integers p and g(q > 0), such that 


p | C 
a —— | [l —, 
Ja—2) < 5 (46) 
then the number a is transcendental. Using the apparatus of continued 
fractions, it is very easy to exhibit as many such numbers as we desire. 
All that is necessary is to choose clements do, a1, +*+ , ap, form the 
convergent p:/gx, and take 


a k-l 
ie dk 
since then 








Pp 1 1 1 
te te a 
k ripe ee +1 qb 


As a result of the above, inequality (46) is obviously satisfied for sufi- 
ciently large values of k, no matter what C > 0 and natural 7 may be. 


10. Quadratic irrational numbers and periodic 
continued fractions 


Theorem 27 shows that, for any quadratic irrational number a, there 
exists a positive number C, depending on a, such that the inequality 


-4$ 


has no solution in integers p and g(g > 0). From this and from Thco- 
rem 23, it follows that the clements of every quadratic irrational num- 
ber are bounded. Long before Liouville, however, Lagrange had dis- 
covered a much more significant property of the continued fractions 
representing these irrationals (one that is even more characteristic of 
them). It turns out that a sequence of quadratic irrational elements is 
always a periodic sequence and, conversely, that every periodic con- 
tinucd fraction represents some quadratic irrational number. The pres- 
ent section is devoted to a proof of this assertion. 
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Let us agree to call the continued fraction 
a = |; Ay, Ay, ...] 


periodic if there exist positive integers ko and % such that, for arbi- 
trary k > Ro, 


Grin = Ok 


In analogy with the procedure for decimal fractions, we shall indicate 
such a periodic continued fraction as follows: 


O==[Ay} 21, Aor +++, Beitr Bho Ghoti, +++ Amahai (47) 


THEOREM 28. Every periodic continued fraction represents a quadratic 
irrational number and every quadratic irrational number is represented 
bv a periodic continued fraction. 

PRooF. The first assertion can be proved in a few words. Obviously, 
the remainders of the periodic continued fraction (47) satisfy the rela- 
tionship 

Prine (k > ky). 


Therefore, on the basis of formula (16) of Chapter I, we have, for 
k 2 Ro, 

— Prop F Peo Pron eth T Prth-? > Pesn-iet Prin—2 

Ir-r T Ig-2 Irth- rin tT Irth- Irth- rt Ue tn—2 


(48) 


so that 
ke-1” k t Pp- 2 Prik- eT Piris 2 
e-r Tig-2 ee ee ee ee eee 


Thus, the number r+ satisfies a quadratic equation with integral co- 
efficients and, consequently, is a quadratic irrational number. But, in 
this case, the first inequality of (48) shows that a too is a quadratic 
irrational number. 

The converse is somewhat more complicated. Suppose that the num- 
ber a satisfies the quadratic equation 


ax? ba +c =0 (49) 
with integral coefficients. If we write a in terms of its remainders of 
order z — Pn-in + Pn-2 


Gn-1"n + fn-2 
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(again using eq. [16] of Chap. I), we see that r, satisfies the equation 
Anra + Bara +C,=0, (50) 
where -[,n, Bn, and C, are integers defined by 
A= APh OP Tn + Ia 
By = 24D g—Pn—-zt8 (Pa-1Jn—2t Pa-29n- D269 n-19n—2 | (51) 
C= Pag TOP gy an2 + CARa, | 
from which, in particular, it follows that 
CA (52) 
With these formulas, it is easy to verify directly that 
Bu—4A,C, = (—4a6) (Pa-19n -0 Ia-1Pn-2)? = b?—4ac, (53) 


that is, that the discriminant of (50) is the same for all » and is equal 
to the discriminant of (49). Furthermore, since 


Pn-1 











C= < 2 3 
Gn Jn-1 
it follows that 
bz X 
Pri5 agaa HAE (aal <I; 
n=! 


therefore, the first formula of (51) gives us 








2 
la a cqa- l 
2 
n 


Bn-1 \2 Ön- 
A, = a (a91 HE) H (20n + 


On-1 





= (aa?-+-ba-te)q?_, + 2a03,_,-+a—— + bnn 
n=l 


from which, on the basis of (49), we have 


a2 


0 
|A {=| 2aab, ya B+ Ob, | < laal Halle 
n—l 











and, on the basis of (52), 
|C,|=|Ag-1 |< 2[aa|+]a]+1 2. 
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Thus, the coefficients .4, and C, in (50) are bounded in absolute 
value and hence may assume only a finite number of distinct values as 
n varies. It then follows on the basis of (53) that B, may take only a 
finite number of distinct values. Thus, as 7 increases from 1 to œ, we 
can encounter only a finite number of distinct equations in (50). But, 
in any case, 7, can take only a finite number of distinct values, and 
therefore, for properly chosen k and +, 


lk 5T kth 
This shows that the continued fraction representing a is periodic and 
thus proves the second assertion of the theorem. 

No proofs analogous to this are known for continued fractions repre- 
senting algebraic irrational numbers of higher degrees. In general, all 
that is known concerning the approximation of algebraic numbers of 
higher degrees by rational fractions amounts to some elementary corol- 
laries to Liouville’s theorem, and certain newer propositions strength- 
ening it. It is interesting to note that we do not, at the present time, 
know the continued-fraction expansion of a single algebraic number of 
degree higher than 2. We do not know, for example, whether the sets 
of elements in such expansions are bounded or unbounded. In general, 
questions connected with the continued-fraction expansion of algebraic 
numbers of higher degree than the second are extremely difficult and 
have hardly been studied. 


Chapter III 


THE MEASURE THEORY OF 
CONTINUED FRACTIONS 








11. Introduction 


In the course of the preceding chapter, we saw that real numbers can 
be quite different in their arithmetic properties. Besides the basic di- 
visions of the real numbers into rational and irrational or algebraic 
and transcendental numbers, there are several considerably finer sub- 
divisions of these numbers based on a whole series of criteria charac- 
terizing their arithmetic nature (most importantly, criteria involving 
the approximation by rational fractions that these numbers admit). 
In all these cases, we have, up to now, been content with simple proofs 
that numbers having certain arithmetic properties actually do exist. 
Thus, we know that numbers exist admitting approximation by ra- 
tional fractions of the form p/g with order of accuracy not exceeding 
1/q? (for example, all quadratic irrational numbers); but we also know 
that there exist numbers admitting approximation of much higher 
order (Theor. 22, Chap. IL). The following question naturally arises: 
which of these two opposite properties should we consider the more 
“general,” that is, which of these two types of real numbers do we ten- 
counter more often’’? 

If we wish to give a precise formulation of the question just posed, 
we must remember that each time we refer to some property or other 
of the real numbers (for example, irrationality, transcendentality, pos- 
session of a bounded sequence of elements, etc.), the set of real num- 
bers is partitioned with respect to that property into two sets: (1) the 
set of numbers possessing that property, and (2) the set of numbers noi 
possessing it. The question is then clearly reduced to a comparative 
study of these two sets, with the purpose of determining which of them 
contains more numbers. However, sets of real numbers can be com- 
pared with each other from various points of view, and in terms of 
various characteristics. We can pose the question of their cardinality, 
of their measure, or of a number of other gauges. As regards both meth- 
ods and results, the study of the measure of sets of numbers defined by 
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a given property of their elements has proven the most interesting. 
This study. which we shall call the measure arithmetic of the continuum, 
has undergone considerable development in recent years, and has led 
to a large number of simple and interesting principles. As with every 
study of the arithmetic nature of irrational numbers, the apparatus of 
continued fractions is the most natural and the best investigating in- 
strument. However, to make this apparatus an instrument for measure 
arithmetic (that is, to apply it to the study of the measure of scts whose 
members are defined by some arithmetic property), we must first sub- 
ject the apparatus itscif to a detailed analysis from all aspects. We 
must, in othcr words, learn to determine the measure of numbers 
whose expansions in continued fractions possess some previously stated 
property. Questions of this kind can be quite varied: we may inquire 
about the measure of the set of numbers for which a, = 2, or for which 
gio is less than 1,000, or which have a bounded sequence of elements, 
or which have no even elements, and so on. The methods used in solving 
problems such as these constitute the measure theory of continued frac- 
tions. It is to the fundamentals and the clementary applications of this 
theory that the present chapter is devoted. 

Since the addition of an integer to a given real number does not 
change the fundamental propertics of that real number, we shall hence- 
forth confine ourselves to an examination of the real numbers between 
zero and one; that is, we shall always assume that ao = 0. Such a re- 
striction to a finite interval is necessary in measure theory if we do not 
wish the measure of a set, in the general case, to be infinite. We are 
assuming that the reader is familiar with the basic propositions of 
measure theory.! 


12. The elements as functions of the number 
represented 
very real number a has a unique expansion as a continued fraction 
a = [ag a, a, ...]; 
cach element a, is therefore uniquely defined by the number a; that is, 
it is a single-valued function of a: 
a, = 4, (2). 


1 The material contained in any text on functions of a real variable will be more 
than sufficient for understanding this chapter. 
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To develop the measure theory of continued fractions, we must first 
study the properties of this function and obtain a general picture of its 
behavior. The present section is devoted to this problem. 

As we noted in section 11, we are henceforth assuming that ao = 0. 
For simplicity in notation, we shall always write 

a = [Q a, ...] 
instead of 
a = [ag; ay, Qo, ate a te 

Thus, 
1 


a, + 


[a,, do, re 1 


ao + re 
Let us begin by examining the first element a, as a function of a. 


Since 
1 
si a Qo + snes” 


it is obvious that a; = [1/a], that is, a, is the greatest integer not ex- 
ceeding 1/a. Thus, 


a,=1, for Is-g; 1.€., radi, 
a,=2, for 21 <3; ie, g LILT. 


a=3, for3ci<4; ie, g <3[5. ete 


In general, 


a=k, for hoa <R+: Le; 7 ae 


The function a, = a,(a) thus has a discontinuity at all those values of a 
for which 1/a is an integer, and it increases without bound as a ap- 
proaches 0. Figure 1 gives a graphical representation. We note that a, 
is constant throughout each of the intervals 
1 1 
RET SSR 
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We shall call these intervals intervals of the first rank. We note also that 


l 


fa (a) da = + oo, 


0 


since this integral is obviously equivalent to the divergent series 
OO oO 
Se- 
E RFI) ARI 
k=1 k=1 


a) (œ) 


w|= 
a|— 
|= 
M| = 


Fic. 1 


Let us now examine the function a2(a), first considering some fixed 
interval of the first rank: 


1 l 
BHI aSk 


In this interval, a1 = k everywhere and, consequently, 
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where 1 < r < œ and œ: = [re]. As r» increases from 1 to ©, a in- 


creases from 1/(k + 1) to 1’k, thus taking all values in the given in- 
terval of the first rank. It is then obvious that 

















a; =l, for lor, < 2; ie, H <a< T’ 
l ] 
a,=2, for 2 Kr <3; 1.€., l qa< 1’ 
kta kt 
; l z 
@=3, for 3 <Lr,<4; ie, — Se T’ 
kt k+7 
and, in general, 
: l l 
a =l, forol<cr,<l+l, ie, r sS*< i 
ka ery 


Thus, in this fixed first-rank interval, the graphical representation of 
the function a2(a) has the form shown in Figure 2. 


a3 (æ) 


|- 
-ð 
|- 
3 
|- 

R~ 


x 

2 
x 
+ 


56 CONTINUED FRACTIONS 


The function a2(a) is constant in each of the intervals 





i l 
Bren 
i IF] 
which we shall call intervals of the second rank. Every interval of the 
frst rank can consequently be partitioned into a countable set of in- 
tervals of the second rank, going from left to right. (We recall that in- 
tervals of the first rank form a sequence going from right to left.) The 
set of points at which a; = k is an interval of the first rank. The set of 
points at which a> = /is a countable set of intervals of the second rank 
(one in each of the intervals of the first rank). Each interval of the 
first rank is defined by a condition of the form a; = k, and each in- 
terval of the second rank by conditions of the form a; = k, a2 = I. 
Suppose that we have defined all intervals of rank 7 and that we 
are investigating the set of functions a;(a), a2(a), +** , dn(a). Each 
system of values 
a,=h,, ay = Ry, aaa ait aa = Rn (54) 


defines some interval J, of rank z. To investigate the behavior of the 
function @n+1ı(a) in the interval Ja, we note that an arbitrary num- 
ber a of this interval may be represented in the form 


a= [k ky n, Rar Tayib (55) 


where 7,41 takes all possible values from 1 to œ. Conversely, for arbi- 
trary řn+ı (where 1 < a41 < ©), the expression (55) gives us the num- 
ber a for which conditions (54) are satisfied and which consequently 
belongs to the interval Jn. Since da41 = [7n41], we see that in each in- 
terval of rank xz, the function an+ı(a) assumes all the integral values 
from 1 to Œœ. To draw a more exact picture, let us agree to denote the 
convergents of the number a, as usual, by p:/g,. Then, 


pa Pal ati + Pn-1 
Gil n+t + n-i’ 


where rn41 increases from 1 to œ as a runs through the interval Jn. 
The numbers pr, Qa, Pa—1, Yn—1 remain constant, since they are com- 
pletely determined by the numbers ay, a2, *** , dn, each of which has 
the same value for all points of the interval Jn. In particular, by setting 
fayı = l and then letting 7,4; ©, we obtain as end points of the 
interval J, the points 
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Pat Pn-1 Pn 
ee E oe and ans 
In + 4n-1 Gn’ 
and since 
g — Pa — Panti t Pn- _ Pa (— 1)” 


a a 


Gn Gn n+1 + an-ı n qn (In n+ F n-i) 


a is a monotonic function of r41 in the interval (1, ©). Conversely, 
rayı and hence, &n+ı—is a monotonic function of a in the interval 


i Qn’ Qnt+4n-i]’ 


thus, as a runs through the interval Ja, the function an+ı(a) takes, in 
succession, the values 1, 2, 3, +++ , partitioning the interval J, into 
a countable set of intervals of rank #2 + 1. This sequence is taken from 
right to left for even 2 and from left to right for odd z. 

Thus, the function a,(a), at least qualitatively, is completely de- 
fined. Let us agree to call the interval (0, 1) the (unique) interval of 
rank zero, and let us cover it with a net of finer intervals, placing in 
every already constructed interval of rank xz a sequence of intervals 
of rank 2 + 1. This sequence is taken from right to left if 7 is even 
and from left to right if 2 is odd. The function @,4,(a) (for n = 0, 1, 2, 
-++) is constant in each of these intervals of rank zn + 1. This func- 
tion is monotonic and takes all integral values from 1 to © in cach 
interval of rank 7. To each system of values 


a=k, Q= ko, ..., Qp = Ry 


there corresponds a uniquely defined interval of rank 1, and vice versa. 
The more general system of values 

Om, = Ry, Am, = Ry, ---, am Rs 
determines, generally speaking, a countable set of intervals. 

The first question posed by the measure theory of continued frac- 
tions naturally consists in determining the measure of the set of those 
points of the interval (0, 1) for which a, = & We already know that 
this set is a union of disjoint intervals. It is then a question of evaluat- 
ing the sum of these intervals. A first approximation to the solution of 
this problem is obtained quite easily. 

From here on, let us agree to denote by 


fy, Ploy oa os i 
E(, Ra.. Rs 
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the set of points of the interval (0, 1) for which the following conditions 
are satisfied: 


Qn, = k, Qn, = ko, ee ey an. = Rs; 


here, of course, all the z; and k; are natural numbers and the 7; are 
all different from cach other. We already know that such a set is al- 
ways a union of intervals. In particular, the set 


b- sep. h 
E(, Ra eee Rp 
is, as we know, an interval of rank 7, characterized by the relationships 
a= k; =n 2, oo ey n). 
Obviously, we always have 
co 
` i ecv Monty Rp Migr weer a 
k=l Rikes ki-i Rn Rivivas <4 k, (56) 


=E(" e...» Mos Niph oer r) 
ian Ry, ere) Rie Ri¢i ane? ks 


Finally, let us agree to denote by WME the measure of the set E. Let 
us consider an arbitrary interval 


qe (” 2, nen) 


of rank 7 containing the interval 


(s) all 2da, re 


Cons ki» Ra seo) Rn S 


of rank z + 1. We already know that the end points of the interval Jn 


are the points 
Pa 


Gn 


and 
Pat Pn-1 


Ia tia- 
where p:/qr denotes the convergent of order & of the continued fraction 


(Ce Seer Rah 
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On the other hand, for all points of the interval J,,:©), we have 
arti = Vail =s, 
and hence, 


SSornap SSH. 


Thus, among all the points 


TIER Prafn+1 + Pn-\ 

Gn nti F an-ı 

of the interval Ja, those for which s < ray1 < s -+ 1 belong to the 
interval J,4:. Hence, it follows, in particular, that the end points 
of the interval J,4:1 will be the points 


PaS + Pn- apg Pals D) F Pa- 








nS + 4n-1 Gn(S HI) Fan- ` 
Therefore, 
MIJ, = Pn _ Pat Pn- =F, (aa dm R ee i 
á q qn F gn-ı Gn (In + an-1) 2 Yn-1 
: HE at Gn 
my, — | Pas +Pn-1 _ Pa(S+1)+Pp-1 | 
oe PnS + 4n-1 dns +1) an-ı 
l 
~ [gas + 4gn-i) [9n (s +1) + Gn-1) 


and, consequently, 


Gn-1 
MIO, l es da 
Mi, Z dans) ( L Ta) 
(1 T Sdn ras S $ San 


Here, the second factor on the right side is obviously always less than 
2 and greater than 4. (The last assertion results from the facts that 


i i l Gn-1 
——_#“_>1 and et ae 
1 -p 422! n 


Sån 
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Therefore, we obtain 


1 OL A 
37 SR, SF (57) 


This shows that, given an arbitrary interval of rank z, the interval 
of rank u + 1 characterized by the value @,41 = s occupies a part of 
the given interval of the order of 1/s*. The fact that the bounds given 
by the inequalities in (57) are completely independent not only of the 
numbers ki, Re, «** , kn but also of the rank z (and are determined 
exclusively by the number s) is extremely important. If we rewrite 
these inequalities in the form 


Wa 20 


Oe ; 











sum over all intervals J, of rank n (or, equivalently, over ki, Re, *°* , 
k, from 1 to œ), and, finally, note that 


Ym,=1 Dwane t), 


we obtain 


35 ar < me("E)<S 


s? ` 


This provides a first approximation to the solution to the problem 
being considered. We see that the measure of the set of points at which 
some defnite element has a given value s always lies between 1/38? 
and 2/s? (and consequently, is a quantity of the order 1/s”). 


13. Measure-theoretic evaluation of the increase 
in the elements 


We now have the necessary tools for solving problems involving the 
measure of sets containing an infinite number of elements. As a first 
example of such a problem, we shall prove the following simple theo- 
rem. 

THEOREM 29. The set of all numbers in the interval (0, 1) with bounded 
elements is of measure zero. 

PRoor. We denote by Ew the set of numbers in the interval (0, 1) 
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all of whose clements are less than M. Let J, be any interval of rank 7 
whose points satisfy the conditions 


a,<M i=], 2. s. n). (58) 


The points of the interval J, that satisfy the additional condition 
n41 = k form an interval of rank 2 + 1. We denote this interval by 
Jn4i. From the first of the inequalities in (57), 


MI > ay Dy 


3k? 

so that 

logy 
N Dagan Va 
k>M k>M 
lm, | 1, du l 
>g IM, d mrgy >a ™, f= gg Wn 

i=l Mri 

and since 


[oe] 

y (k 
Pan See = Ins 
k=] 


it follows that 


mV shi — pt lms a ty. 69 
x 3 (M + Ti 
where 
— l l ' 
eC 


here, obviously, 7 < 1if M > 0. 

If we denote by E® the set of numbers of the interval (0, 1) char- 
acterized by conditions (58), we see from inequality (59) that the meas- 
ure of that portion of the set E{?+” contained in some interval J, of 
rank z is less than that of 7t/,. Since, obviously, an interval of rank 17 
that does not belong to the set E\7) (that is, one that does not satisfy 
the conditions in eq. [58]) cannot contain any point of the set EY", if 
we sum inequality (59) over all intervals of rank z in the set £7), we 
obtain 


MED < MEH. (60) 
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Successive application of this inequality gives us 
MEZD < IMEQ (nal). 

so that 


MEY —>0 as n—>%, 


since 7 < 1. But the set Ex that we have defined above is obviously 


contained in each of the sets EY. Consequently, 


ME a = 0. 
Now setting 


X Eu=E, 
M=1 


we obtain 
oO 


MEL $ MEy = 0. 
=| 


But every number with bounded elements obviously belongs to the set 
Ex for sufficiently large M and, hence, belongs to the set E£, which 
proves the theorem. 

We know (Theor. 23, Chap. II) that numbers with bounded ele- 
ments are those numbers a that do not admit an approximation by 
rational fractions better than in accordance with the law 


zg (61) 


(We note that among these numbers are all quadratic irrationals.) We 
sec now that all these numbers form a set of measure zero. In other 
words, a/most all numbers (that is, all but a set of measure zero) admit 
a best approximation by rational fractions. Evidently, then, the basic 
problem of the measure theory of approximation is the question of de- 
termining the measure of the set of numbers admitting some specified 
degree of approximation by rational fractions. In particular, what is 
the best law of approximation admitted by almost all (see above) num- 
bers? In other words, within what limits can the law given by in- 
cquality (61) be improved if we agree to neglect the set of numbers a 
which is of measure zero? We shall solve this problem in the next sec- 
tion. 
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THEOREM 30. Suppose that (n) is an arbitrary positive function with 
nalural argument n. The inequality 


a, = a, (2) > > T (72) (62) 


iS, g almost all a, satisfied by an infinite number of values of n if the 
series Sa 1/g(n) diverges. On the other hand, this inequality is, for al- 
mosi all a, satisfied by only a finile number of values of n if the series 
Tre 1/g(n) converges. 

PRELIMINARY REMARK. In particular, if we set the function (1) 
equal to a constant positive number M, we conclude from Theorem 30 
that the set Ex, which we used in proving Theorem 29, is of measure 
zero. Thus, Theorem 29 can be regarded as one of the simplest cases 
of Theorem 30. 

ProoF. The first assertion of the theorem is proved in a manner com- 
pletely analogous to that used in the proof of Theorem 29. Suppose 
that Jm+n is an interval of rank m + n at all of whose points 


amni < (m +i) (i=l, 2,..., N). (63) 


(We shall not impose any conditions on a, &2, *°** , Gm.) Keeping the 
notation that we used in the proof of Theorem 29, we obtain the in- 
equality, analogous to inequality (59), 


a J” I 
Me > Jm+n+1 < fı EEN ETEEN 
k<¢(m+n4l) 


When we sum this inequality over all intervals of rank m + n that 
satisfy conditions (63) (denoting the set of all numbers of the inter- 
val (0, 1) satisfying these conditions by Em, n), we obtain 


l 
MEm an <{! appa aF] Ea 


Successive application of this inequality gives 


n 


DEn, a < DEn JT Sa arl 


If the series Ei, 1/g(n) diverges, the series 


l 
= 3(1+o(m+1)) 
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will obviously, for arbitrary 7, also diverge. From the theory of infinite 
products, it follows that the product 


Uere 


t= 


approaches zero as 7 —> ©. Thus, we have, for arbitrary m, 
ME 7 >O as n>. 


But every number a for which 


amet < (m-i) (=l, 2: ees 
obviously belongs to all the sets 
|EN OSR 125 ah 


therefore, the set of all these numbers, which we shall denote by Em, 
must be of measure zero. Finally, if we set 


E HE+... HE +4... =E, 


we sce that ME = 0. But every number a for which inequality (62) 
is satisfied only a finite number of times must, obviously, for sufficient- 
ly large m, belong to the set Em and, hence, to the set E. This proves 
the first assertion of the theorem. 

Suppose now that the series Xiz; 1/¢(2) converges. Suppose that 
J, is one of the intervals of rank xz and that J{*), is an interval of rank 
n + 1 contained in J, and defined by the additional condition that 


dn41 = k. From the second inequality of (57), we have 


DI <M, 
so that 
o 
Pe > JP < DIM, `” + 


k> (n+!) k>9(n+)) 


. 
<2 SWEDE 


l du| AMI 
<an aay + J |= h 


¢(n+1) * 
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If we denote by F, the set of numbers of the interval (0, 1) for which 
a, Z g(x) and sum the inequality obtained over all intervals J, of 
rank z, we see that 


4 
Fan < SG ED’ 


since IMJ, = 1. Thus, the measures of the sets Fi, Fo, +++ , F,, ++: 
form a convergent series. Denoting by F the set of those numbers in 
the interval (0, 1) that belong to an infinite number of sets Fn, we 
then have? 


MF = 0; 


But the set F is, of course, just the set of numbers for which the in- 
equality (62) is satisfied for an infinite number of values of x». This 
proves the second assertion of the theorem. 


14. Measure-theoretic evaluation of the increase in the 
denominators of the convergents. The fundamental 
theorem of the measure theory of approximation 


THEOREM 31. There exists an absolutely positive constant B such that al- 
most everywhere, for sufficiently large n, 


In = p (4) < e". 


PRELIMINARY REMARK. We saw in section + of Chapter I (Theor. 12) 
that the denominators g, for all numbers a increase with increasing x 
no more slowly than some geometric progression with absolutely con- 
stant ratio. Theorem 31 asserts that, for almost all a, the numbers g» do 
not increase faster than some other geometric progression, also with 
absolutely constant ratio. This situation can be expressed in a differ- 
ent way: there exist two absolute constants a and -1 (where 1 < a < -1) 
such that, for almost all numbers a in the interval (0, 1), for sufficient- 
ly large 7, 


a<Vq,<A. 


2 This is a well-known theorem in measure theory. However, here is the prool: 
Obviously, the set F is, for arbitrary m, contained in the set SP _ml'n; the measure 
of the latter set does not exceed Spa mMFa and, consequently, for sulliciently large 
m, it may be made arbitrarily small. 
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In fact, there is a considerably stronger proposition: there exists an 
absolute constant y such that almost everywhere 


n 

VIa >1 (n — 00); 
however, the proof of this theorem is considerably more complicated 
and requires certain more powerful tools, which we shall discuss in 
sections 15 and 16. Unfortunately, the framework of the present book 
does not allow inclusion of this proof. On the other hand, for our im- 
mediate purpose, which is to establish Theorem 32, the property of 
the numbers g, referred to in Theorem 31 is quite sufficient. 

PRooF. We denote by £, (g) (n > 0, g > 1) the set of numbers in the 
interval (0, 1) for which 
Ajg e. a, >. 


Obviously, this set represents a system of intervals of rank n. The 
length of any one of these intervals is, as we know from section 12, 
equal to 

Pa Pa ae Pr-| l l 


—_ — =e a ae 
Gn lata- 1a (a Fia- as a Gy +++ Gd,) 


since successive application of the obvious peel 


Vn > aain- 
gives 
In > Anln-1 «+. Aali. 
Therefore, 
ME sos es 
(D< 2 ame 64) 


a,a, -a8 1,28 


where the summation is taken over all combinations of natural num- 
bers 4&1, @2,°*+,@, that satisfy the inequality ajas +++ an > g. To 
evaluate this sum, we note that 


4 n 
ie T ] (1+ Pe 1 : ] 
a ees 9 ol 
izl ti ai (a + D) S < JI a;(a;+ 1) 
n @,+i1a,+1 at) 
== "TI ie e dx, dX... dx 
= i fd es 2 J f es . . “ee a i 
i=l a; t a, a; a, 1*9 «6. i 


* Proof of the above statement was obtained by Khinchin in 1935. See “Zur 
metrischen Kettenbruchtheorie,” Com positio Mathematica, 3, No. 2, 275-285 (1936). 
Soon afterward, the French mathematician P. Lévy fond an explicit expression 
for the constant y, namely, In y = x?/(12 In 2) (see P. Lévy, Théorie de laddition 
des variables aléatoires, Paris, 1937, p. 320). (B. G.) 
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X IT TEO 
>g (iar ! 


a2, aoe an 


and, consequently, 


where J,(g) is the #th-order integral 


se fg eae 


over the region 


XXa -e Xp > B. 


For g < 1, this region is obviously the region 1 < x;< œ (for 
a= 1,2, -++ , 2), and we obtain 
cD 


„o= f£] =] (e < 1). (65) 
1 


Let us now show that, for g > 1, 


n=l 
1 
Ja (8) = g `” Un 
i=0 


For n = 1, this equation is of the form 
co 


dx 1 
x2 Er 





(66) 


and hence is truc. Assuming it is true for n = k, we have 


co 


g 
dx l 
J = | Sees E Aa fs d 
p+1 (8) . ca e ) g, p (4) du 


Akl 


l g 
1 
-1| nwu J nora} 


If, in the first integral, we substitute the value of J;.(#) given by for- 
mula (65) and, in the second, that given by formula (66) (which we 
are assuming established for n = k, g => 1), we obtain 


R . 
] l4 1 {] yí 
eft + See lee ae 
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which proves the assertion. Thus, 


n-1l 
2” (Ingy 
S i=0 


In particular, if we set g = e^”, where A > 1 ìs a constant, we have 


ME, (8) < 


n-l : 
An)! 
ME, (e4) < er tin2—a) YEW 
l=0 
It is casy to sec that in this sum each term is less than 
(An)" , 
n! ' 





therefore, if we use Stirling’s formula for approximating the factorial, 
we obtain 


ME, (247) < er (in 2- Ay ST (Any 


n An)" 


< C g" (In2- A) = 
l nh ena Vn 


< C, V ne-2(A-m Ann 2-9), 


where Cı and Ce are absolute constants. 
But if A is sufficiently large, 


A—nA—In2—1>0, 


and, consequently IR#,(e4”") is less than the nth term of some con- 
vergent series. Since the series 


2 ME, (e4”) 


converges, every number in the interval (0, 1), with the exception of 
a set of measure zcro, belongs to only a finite number of the sets 
i,,(e"). This means that for almost all numbers in the interval (0, 1), 
we must have, for sufficiently large 7, 
ails... ap L E^; 
also, since 
In = 2nIn-1 = i Gn-2 < 24 n9n-1 
and, consequently, 
n 
E OA E EE 
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it follows that almost everywhere, for sufficiently large 1, 
In < DigAn — eBn 


where B = «! + In 2. This completes the proof of Theorem 31. 

This result, which in itself is of considerable interest, is especially 
important for us at the moment, since we can use it to obtain a simple 
solution to the basic problem of the measure theory of approximation. 

THEOREM 32. Suppose that JG) isa positive con-inuous function of a 
positive variable x and that xf(x) is a non-increasing function. Then, the 
inequality 


Ja — — 2] < (67) 


has, for almost all a, an infinite number of solutions in integers p and q 
(with q > 0) if, for some positive c, the integral 


f fax (68) 


diverges. On the other hand, inequality (67) has, for almost all a, only a 
finite number of solutions in integers p and q (with q > 0) tf the integral 
(68) converges. 

PRELIMINARY REMARK. In particular, on the basis of Theorem 32, 
the inequality 


|2— AT 


has, almost everywhere, an infinite number of solutions. On the other 
hand, the inequality 


has, for every constant e > 0, almost everywhere, only a finite num- 
ber of solutions. From these facts, we can get an approximate idea of 
what changes to expect in the general law of approximation if we agree 
to neglect a set of measure zero. 

Proor. Part 1. Suppose that integral (68) diverges. Let us define 


o (x) = eB f (e9*), 
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where B is the constant referred to in Theorem 31. Then, the integral 
A BA 


fecacaz f faau, 
Ba 


a 


where -1 > a > 0, increases without bound as A — o. Since the func- 
tion ¢(x) is, by hypothesis, non-increasing, the series 


co 
sd 
Èg (1) 
nal 
diverges. On the basis of Theorem 30, we now conclude that, almost 
everywhere, the inequality 
l 
ait > TR 
i+] Eca o (2) 
is satished for an infinite set of values of 7. But when this inequality 
is satished, 
Pi l l (i) 
7,|< <—z < $F. 
I Glis alli li 








(69) 


On the basis of Theorem 31, we have, almost everywhere, for sufficient- 
ly large 2, 


q; < ent. 
so that 
i> Le 


Therefore, inequality (69) almost everywhere implies the inequality 


nj) ro, 


a — — 
q; qj q; 





for sufficiently large 7. This inequality is satisfied almost everywhere 
for an infinite set of values of 7. This proves the first assertion. 
Part 2. Let us now suppose that integral (68), and hence the series, 


2S, 
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converges. Let us denote by E, the set of numbers a in the interval 
(0, 1) that, for a suitably chosen integer k, satisfy the inequality 


k- 4] <A, 


(Obviously, the set £, consists of the set of intervals of length 2/[n]/n, 
with centers at the points 1/1, 2/2, +++ , [n — 1]/n and of the inter- 
vals (0, f[n]/n} and {1 — flz]/n, 1}.) We then have 


ME, < 2f (n). 


(The symbol < holds if ffn] > 4.) Thus, the series 


converges. We conclude from this, just as we have done on previous 
occasions, that almost every number a in the interval (0, 1) can belong 


to only a finite number of sets En. This means that almost all the num- 
bers a in the interval (0, 1) satisfy the inequality 


peau 


for a sufficiently large positive integer q and for an arbitrary integer p. 
This proves the second assertion of the theorem. 

In the next section, we shall learn a method that makes it possible 
to solve much more profound problems in the measure theory of con- 
tinued fractions. 


15. Gauss’s problem and Kuz’min’s theorem 


The problem that we are about to discuss was, historically, the first 
problem in the measure theory of continued fractions. This problem, 
posed by Gauss, was not solved until 1928.4 
Setting, as usual 
O07 Bu Ope ase wees 


I n = Tan (2) — [a,; Aati’ Onto s.’ 1, 


4See R. O. Kuz’min, “Ob odnoi zadache Gaussa” (a problem of Gauss), Doklady 
akad. nauk, ser. A, 375-380 (1928). Another solution was published in the article 
by P. Lévy, “Sur les lois de probabilité dont dependent les quotients complets ct 
incomplets d'une fraction continue,” Bull. Soc. Math., 57, 178-194 (1929). (B. G.) 
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we denote by Sn = Sala) the value of the continued fraction 
[03 anir Fain +++), 


that is, we sel 


Obviously, we always have 


Om ží: 


We denote by m(x) the measure of the set of numbers a in the inter- 
val (0, 1) for which 
Zz, (2) << x. 


In one of his letters to Laplace, Gauss stated that he had succeeded 
in proving a theorem that implied that 


: I+x 
lim m, (x) = aL) O<x< bt. 
n> œ In 
In his letter, he indicated that it would be very desirable to obtain an 
estimate for the difference 


In(1 + x) 


m,(x) Ta In 2 (70) 


for large values of z, but that he had been unable to do so. Apparently, 
Gauss’ proof was never published, nor were other proofs of his assertion 
known before 1928, when Kuz’min published his proof and gave a good 
estimate for the difference (70). The present section is devoted to an 
exposition of these results and of certain generalizations of them that 
we shall need later? 

It was already known to Gauss that the sequence of functions 


m(x), m(x), M(X), ..., MaX) ... 
satisfies the functional equation 


oO 


Mrs (4) = 2 {m (7) (yrr) O0<*< l,n>0), (71) 


* Like Gauss, Kuz’min formulated the results in probability-theory terms, which, 
of course, does not affect their content from the standpoint of measure. 
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To show this, note that, on the basis of the obvious relationship 


l 
L,———______ 
a an+1 F Zn+ı ; 
the inequality 


Zn LX 


is satisficd if, and only if, for a suitably chosen positive integer k, 


l l 
EF SaS E 


Since the measure of the set of numbers satisfying this inequality is 
obviously 


nin (@)— mhr) 


relationship (71) holds. 
It can easily be verified directly that the function 


9 (x)= C In(1 + x) 


satisfies the equation 
co 


= Bee) —*(ezz)} 


for an arbitrary constant C, which probably helped Gauss in finding 
the proper expression for the limit of the function m,,(x) as n @. 
Formal differentiation of equation (71) gives 


mri Dy aaa (Fx): (72) 


The validity of (72) can easily be shown in a rigorous manner. Since 
obviously zo(a) = a, we have mo (x) = x, and hence m’o(x) = 1. If the 
function m’,(«) is, in general, bounded and continuous for some x, the 
series on the right side of (72) converges uniformly in the interval 
(0, 1). The sum of this series is therefore bounded, continuous, and 
equal to m’,4,(x) (due to the well-known theorem on termwise differen- 
tiation of series). Thus, (72) is proved inductively. 

Equation (72) is much more convenient for making investigations 
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than is equation (71). Kuz’min’s basic result, which we shall now prove, 
has to do with this relationship. 

THEOREM 33. Suppose that fi(x), Jax), s+ , faa), +++ is a Sequence 
of real functions defined on the interval (0, 1) that, on that interval, satisfy 
the relationship 


fan (= >», T Ín (=) (n > 0). (73) 
k=l 


lj, jor 0 SA's 1, 


0 < falx) < M 

and ; 

| Fo] <p, 
then, 

ie = gF tlena 0<x< 1l), 

where 

l 

l L 
a= | hdz, |6,< 1, 


0 
Nis an absolute positive constant, and «l is a posilive constant depending 
only on M and u. 
The proof is complicated, and therefore we shall first give several 
elementary lemmas. 
Lema 1. For arbitrary n = 0, 


(n) 
NW ¢ ( Pat*Pn- l 
Ín a 2 fo( 2 + TPE ) (qn + XGn-1)? ' (74) 


where (pa/q, (Pn + Pa—1)/ (Gn + Qn—1)) is an arbitrary interval of rank n 
and the summation takes place over all intervals of rank n (or, what is the 
same thing, over the elements ay, as, °+* , an from 1 to œ). 

Proor. For z = 0, (74) is trivial, because in that case there is a 
unique interval (0, 1) for which po = 0, go = 1, p-1 = 1, and g_1 = Q. 
Assuming now that equation (74) is valid for some 7, we have, on the 
basis of the fundamental equation (73), 


foi O= È apap llars) 
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7 = i a Pat kx Pn-\ l 
T kad (RX)? kad A 
k=1 ) n + ki Jn—~ı (an+ Rox an-) 
(n) œ 
—Ş i (Pank + Pn-1) + *Pn l 
dad ad Ol (nk + gn-1) H *gn | nk +ga- + Gn}? 
(n +1) 
E Yy f (Zeit Pa) l 
wd ` À n+ H Xin) (nti F Xan) ' 


which proves the lemma. 
LEMMA 2. Under the conditions of Theorem 33, for n > 0, 


F CO] say HAM. 


On 
PROOF. If we differentiate (74) termwise, we obtain 
(n) (n) 


yar} 
fwo=)> fi) oo — 2D fat Yn—! 











+ XGn- 1)" (Qn F Xin) ' 
where 
p= PatXPn-1 
Gn + XGn-1 


The validity of termwise differentiation follows from the uniform con- 
vergence of both sums on the right side for 0 < x < 1. We note that 
se 
Un F rga- Sn Qn Gn) | 


Then, on the basis of Theorem 12 of Chapter I, 
Ia (la Fani) > R es 


and, in view of the obvious relationship 
(n) 


(n) 
1 
» q (a + Fn-1) z 2 


we have, because of the conditions of Theorem 33, 


AE < Gor + 4M, 


Pn PatPn-1 


Pa _ PaT a Best | ], 


Gn Gn + Qn-=1 





which was to be proved. 
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Lema 3. If 


t T 
Tae OS ee 


0<x<!), 
il follows thal 
E < fan <a (O< x <1). 
Proor. Under the conditions of the lemma, the fundamental equa- 
tion (73) gives 
V gee 5 l 
1 = CEST < Savi) < 2 — RF 
im Pa k+x Tele + x 
or 


l a l 
onl (k+ x (k++!) < fns) < TY, (2 + x) ( 





k+x+1)’ 
or, equivalently, 
ae 
Sea FEF) < fan) < P epee i): 
or, finally, 
t f see 
1+ x < Jari) < l+’ 
which was to be proved. 
LEMMA 4. 
1 1 
f fa dz= f fo@)dz =N E 2ra) 
U 0 


Proor: Because of the fundamental equation (73) (for n > 0), 


fi, @a=> ft exe) e Far 


k=10 


R+1 


; feasted f fa-yCe)ae, 
k=1 1 0 


so that the lemma follows by induction 
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Proor OF THEOREM 33. The function fo(x) is, by hypothesis, differ- 
entiable and hence continuous for 0 < x < 1; since it is, again by 
hypothesis, positive in that interval, it must have some positive mini- 
mum, which we shall denote by m. From the condition m < fo(x) < M 
(for 0 < x < 1), we obtain 


2M 
sign ThT (0< « <1), 


or 
G 
The <Ir 0<), 
where 
g=5> G=2M 
We now define 
fa) Tir = anA O<xdi, n=0, 1, 2,...). 


On the basis of Lemma 3, the function F(x) = g/(1 + x) satisfies the 
equation 


ron Sr(ealete 


(which can easily be shown by direct verification). From this, it obvi- 
ously follows that the sequence of functions 


Co (2s Cy Or eon Op Oe he 


satisfies (73). Therefore, all the conclusions that we have deduced from 
that equation, in particular (74), are valid. As before, we set 


— PatxPn- 


u = 
n F XGn-1 


for brevity. Thus, we have 
(n) 


z l 
G(X) = 2, (U) -—, 
a= A 
so that, on the basis of the obvious inequalities 


dn F Xan- < n FIn- < 2q n and Zo (u) > 0, 
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we obtain 
(n) 


Fn >F 2 D Fo (u) Gn (In 7 Jn-1) ` (75) 


On the other hand, the mean value theorem gives us 


l (n) 
Ir 1... l 
zj A (Op 2 > Fo (4) Gn (Qn + 4n-1) ’ (£9) 


where x’ is one of the points of the interval (pn/qn, (pn + Pn—1)/ 

(gn + gn—1)), and 1/[gn(gn + Gn—r)] is the length of this interval. Re- 

lations (75) and (76) give us 
1 


(n) 
l 1 1 
G(X) — > Ja (z)dz >> À lo — au) PET E 
0 
But since, obviously, (77) 


le(Ml<lf()lte<pte O<!) 
it follows that 





be Ae f nad —— / ee a 
| So (4) — ego (U) | < (t+ g)lu—u’l< Gn (Gn + qai) 
+g T 
< ma < TA ” 


Inequality (77) then gives us 
1 


Ga) > 5 f: 
0 


l 
I= f odz. 
0 








e+e, ute 
a aan 


n 
where 2 


Thus, we obtain 


J. (x) > Sey ae Hrs > apie ree +g) — £! 


or l+x 1+x 


If we examine the sequence of functions 


(= pee—Sa(*) — (W#=0, 1,2...) 
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and follow the same line of reasoning as before, we obtain the inequality 


— F -n1 
ioc a sto =. 


where : 
r=5 | poaz. 
0 


Since / > 0 and /’ > 0, we have, for sufficiently large », 


gL <a 


and 


G — 8 <0 —g— (I+) H2"? (u40). 


Also, since 





1 
„ifa In 2 
I+ = zf oy dz=(G—g)->, 


we have 
Gi— 8, <(G— g)? 42777? (44+ G), 
where 
ô = | — m2 <i 


is an absolute positive constant. 
Let us summarize the results that we have obtained. From the con- 
ditions that 


= z Lf) <p ey. | Fok) |< pes 


we have shown that, for sufficiently large n, 





rey <i <TH 


where 


g<g <0 <0, Gg < (0—8) +27" 49). 
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Ii we now start with the function fa(x) instead of fo(~) and repeat the 
line of reasoning that we have used, we obtain 


and 


B1< Be << a2 < G,, 
Gy — g: < è (Gi — g) +277 4G), 
where u is a positive number such that 


O< OS <I). 


If we carry this process further, we obtain, in general, 





| 7 G 
mera aO O0<x<l; r=0, 1, 2,...), 


and, for r > 0, 
8r-1 < Er < G, < Gris (78) 
-n+2 
cee ae ee a) ee a 


where u, is a positive number such that 


(Traa) < tr- (O<Cx< 1). 
On the basis of Lemma 2, we can set 
u, = —— a = + 4M (r =0, 1, 2, ...), 


and, consequently, if z is chosen sufficiently large, 
t, < 5M CSN esh 

ee successive application of inequality (78) gives us (forr = 1, 
2 e in 

Ga — En < (G — g) ù 427977 (u 42M) èr! 

+7Me"-24-7Mo"-34 ... +7Mi+7M}. 
Since 6 < 1 is an absolute constant, it obviously follows that 
Gn — 8n < Be-™, 
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where A > 0 is an absolute constant and B > 0 depends only on M 
and u. 
From this, it follows, first of all, that there is a common limit 


lim Gi = lim g&n =a 
n> oO A> n 


and that 
a A 
| fr (a l+x | < Be- (0 <x <1), (79) 


so that, in particular, 
1 
f fe @)dz—>ain2 (n —> œ), 
0 


and, consequently, on the basis of Lemma 4, 


1 
axa | haz 
0 


Finally, suppose that 
WON <(a+1). 


Since, on the basis of inequality (79), 


a—2Be—" a+ 2Be— 


Tpx AL T 
it follows from Lemma 3 that 


2Be—>" 
< f(x) < EET, 


a —2Be™>" 
l+x 


or 





fn (x) — j a < 2Be~-M@=— Agathe Ae, 


where A = 2Be>. This inequality, which we have established for sui- 
ciently large X, is obviously true for all V > Oif we choose a sufficient- 
ly large constant .!. This completes the proof of Theorem 33. 

Let us now turn to Gauss’s problem. If we set 


fi =m a) O<x<)), 
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we obtain fo(v) = 1. Therefore, all the conditions of Theorem 33 are 
satisned. Thus, we obtain 


Pan? —-.Va 
|m, (x) — ram <4 OSS Ve O 


from which, by integrating, we obtain 


Ma (x) — BEE) | < Ae Va (O<x <1), 
where .1 and à are absolute positive constants. This not only proves 
Gauss’s assertion, but also gives a good approximation of the remainder 
terin.” 

Let us now use this result to obtain an approximation of the meas- 
ure of the set of points for which a, = k for sufficiently large values 
of z. Since, obviously, the condition an = k is equivalent to the in- 
equalities 


l l 
Fal ine 


it follows that 





l 
k 
n | f 
ME) = mi (F) malt 5j” a(x) ax. 
On the basis of inequality (80), it follows that 

| ni trey} 

mel? k (k + k (k +2) | A Vai 

|me(?)— In 2 Speen? Ce 


We now have a precise limiting relationship for the quantity me(7): 


° The method that Lévy used makes it possible to obtain a better approximation. 
lie showed that 
In(lt« 
My, (x) — aot) < Ae-)a (O< x <1). 


(B. G.) 
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for which we had only rather crude inequalities in section 13. Specifi- 
cally, 


(n —> œ). 


l 
me(;) “lapel 


Thus, for example, the measure of the set of points for which a, = 1 
approaches the quantity 


In2 


aS 11—> @, 

Besides proving Gauss’s assertion, Theorem 33 enables us to obtain 
a more general result, the importance of which will be shown below. 
We denote by M,(x) the measure of the set of numbers belonging to 
some fixed interval of rank k and satisfying the condition g;,, < x; in 
other words, M,„(x) is the measure of the set of numbers in the interval 
(0, 1) satisfying the condition 


a,=l), a = Fo, Ekai ap =T pg; rA i E (82) 


where 71, 72, °°* , 7: are certain fixed natural numbers and where 
n > Oand «(0 < x < 1) can be varied arbitrarily. 

For the conditions (82) to be satisfied, it is obviously necessary and 
sufficient that 


1 l 
a =T ü = T9, e. lp =T ps Ppa Srn- SF 


where 7 is some natural number. It then follows that 


mo= Sf aya(2) -M(H e> 0< x <0) 


r=] 


so that the sequence of functions M(x), Mix), e, M (x), ore 
satisfies equation (73). 

An arbitrary number a of the interval [px/ge, (pe + px—1)/ 
(gx + gx-1)] can be represented in the form 


ee Peon ay T Prai 
Venti t Ika 


8&4 CONTINUED FRACTIONS 


or, since 5; = 1 rest, 
Pp +2 pPp_} 
a = 2 EF" 
Tp Tkig- 
For 2. < x, the number a must lie between prq: and (p; + vps—1)/ 
(ge fb Vyr-1). Therefore, 


allp Hap”) 


Pe Ppt *Pg-\ x (83) 
P 1, F41 


My (x) = 





Ii we now set 
l, 2: oe ay k 


ry Poy t. .p lg 


M, (x) = WE ( n(x) (n>0, O< x <1), 


we obtain a new sequence of functions: 
AD Vi Coy eats TO), 225 


here, the functions x (x), which differ from the corresponding func- 
tions M’„(x) only by a constant factor, also satisfy (73). Since obviously 


me Dig. a A )= 
EE OE y 


Ppt Pp) |__ 1 


Pe SEE 
1» Ip yy 9, (9%, + Ip) 








equation (83) gives us 


(9, + 9,_1)* 


70 (x) = J -+ E. 


so that 

i 4, (1, +9,_4) 
L(x) = R\tk k 7 
(ap Fag”) 


and 


r 201r- Fip) 
x (Ie Fap) 
Thus, 


l , 5 n 
y yu) < 2 [Zo |< 4 O<x«< 1). 
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This shows that Theorem 33 can be applied to the sequence of func- 
tions x’,(v), where the numbers -1 and X will be arbitrary constants 
(in particular, independent of r1, 72, °°° , r). We thus obtain 


ee M’ (x) o l ET ti 
a) iel io. Cane [9< 1. 
Lirfa., 5) 


If we integrate this relation between the limits 1/(r + 1) and 1v, 
where vr is an arbitrary natural number, we obtain, for 0! < 1, 


l l l 
a E treet a yy, 
ME ( l, r In 2 r(r+1) 

Fy, fa.’ FR 


and since, obviously, 


m, (4)—m, ( l ) = me ea aoe, 


r r+ ae T a E oe 
we have 


mE ( | a AEE Aas 


fi Fay woes rp, r 


[nti rez] vA YT) me pe ‘i 


In 2 r(ir+1) yy fa.. FR 


Finally, we can sum this relationship from 1 to œ for certain of the 
numbers 71, 72, «°° , rą (arbitrarily chosen). As a result of this summa- 
tion, the terms with like subscripts disappear on both sides of the equa- 
tion, so that instead of a succession of subscripts 1, 2, «++ , k, we ob- 
tain a succession of completely arbitrary subscripts Hi, M2, *** , Me 
In all other ways, the equation remains unchanged. Thus, we obtain 
the following theorem. 

THEOREM 34. Two absolute positive constants A and N exist such that, 
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for ny Sng See <ni <L Hea and for arbitrary positive integers 
Kiera, ttt Ta f, 


Nyi Aa esos Np ) f 1 } 
yy ——————_a 
: mee (” ETEN fpr tae 
| me ( i ee I In 2 
i Fis Tas cen fi 
A -àA Vn -n,-!} 
{+1 t 
< r(r +1) d : i 


This result shows not only that the measure of the set of numbers 
in the interval (0, 1) for which a, = 7 approaches a definite limit as 
n — œ, but also that the relative measure of the set of numbers satis- 
fving this condition, given arbitrary fixed values of an arbitrary set 
of preceding elements, approaches the same limit. 


16. Average values’ 


The results of the preceding section enable us to prove the following 
general proposition. 

THEOREM 35. Suppose that f(r) is a non-negative function of a natural 
argument r and suppose that there exist positive constants C and 6 such 


that 
Via 


4 


JOLO (r=1, 2,...). 


Then, for all numbers in the interval (0, 1), with the exception of a set of 
measure sero, 


+ 
imt Y S D= Yio hitzera (84) 
n >œ k= rel 


PRELIMINARY REMARK. The convergence of the series on the right 
side of (84) follows, of course, from the condition imposed on the func- 
tion f(r). 

Proor. Let us define 

1 1 
f f (ap) d2 = lp, f [J (a,) — u,|? da = bp, 
0 


0 


‘The results of this section are to be found in Khinchin’s article “Metrische 
Kettenbruchprobleme,” Com posilio Mathematica, 1, 361-382 (1935). (B. G.) 
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1 


J (Fa) ~ ui (FG) — 45) da = Bip 
0 


A > a (S (a) — 44} = Sp = s, (2). 


The existence of all these integrals follows easily from the properties 
assumed for the function f(r). For since 

pi (r)}? < Cepl— 28 
it follows that 


l œ 
frana opme(*) <e 2c, 
0 r=l 


r=1 


is meaningful. The existence of all the integrals then follows on the 
basis of the Bunyakovskii-Schwarz inequality. In particular, it fol- 
lows that 
1 
2 
br = f (S (0p)? da — uh < Ch. 


0 


i w n (99) 
Up = f f(@,)az < Vj If (ap)? da < Ve 
0 0 
Furthermore, for k > t, we obviously have 
Ein = J Edf (a) da — uuy 
: (86) 


\’ i, R 
= > f(r) f(s) ME (7 A — U jp. 
r, S=1 
But, on the basis of Theorem 34 a the inequalities of section 12, 


aeg "Lape ae (i) 


oe i—l 
<-s GFI mE (;) ve 


< 3Ae-)V k-i- ENE jne ) 
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and 


In2 
Ae->V R-i 
< s(s+1) 


: k n{itseray | | 
WE (s)—-—j T 
< jaen N, 


If we multiply inequality (88) by WE ( ) and compare the resuit with 


inequality (87), we obtain 


| we ($ e — me ( \ne(f )| 
< 6Ae-AY R-i- ARE \ nie (' ), 


as a result of which (86) gives us 


8 in — X sosone(’ ye ($ )+ au, 


f, 5=1 








< 6Ae-} VETI- 5 f(r) f(y MEF Jane ($ J: 


r, S=1 
Noting that 


“| 8 


f(r) f(s) ME() MEF) = ayn, 


r, $=) 


and using the second inequality of (85) to estimate the right side of this 
cquation, we obtain 
From a andl (89), we have, for n > m > 0, 


fe, — sara f y d apup] as 
kR=m+l 
n 1 


— ` f SaD — u)? da 


k=m+1 0 
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n l (90) 
i > > f (S Ca) — u} (f (ap) — up) da 


i=m+1 k=i4+41 0 


= ba b,+ 2 > 2 Eie < Cy (n — m) 


k=m+1 i=m+l kai+l 


+1246, X) X eY < C,(n— m), 


i=m+1 k=i+1ì 


a 


where Cs is some new positive constant. 
We now denote by en the set of numbers in the interval (0, 1) for 


which 


|s, | > en, 


where e is an arbitrary given positive constant. Obviously, 


l 
0 


en 


so that inequality (90) (for m = 0) gives 


0 Ce 
Mer S age S Pn ' 


Thus, the series 


converges and, consequently, as we know, almost every number in the 
interval (0, 1) belongs to only a finite number of sets e,2, for n = 1, 2, 
3, ++» . This means that, for almost all numbers in the interval (0, 1) 


and for sufficiently large 7, 


So 
n e 
a LE 
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and since e is arbitrarily small, it follows that almost everywhere 


Sa 
lim —- = Q. (91) 


2 
A> oO n 


Furthermore, for n? < N < (n + 1)°, formula (90) gives 
1 
f (sv — sr) da <Cy(N — n?) < C, (20 + 1) < 3Cr. 
0 


If we denote by en, y the set of numbers in the interval (0, 1) for which 
Sy — Sẹ > en? and if we set 
(2+1)?—1 


2 ve 


i nr 
N=ñ* 


we then have, for n? < V < (n+ 1)’, 


1 
f (Sy — SmX da > f (Sy — Sn?) dz > e*ni Me, Ns 
0 


en, N 
3C 
Men N < aan 
ol 3C,(2n+1 
ME, < `” Men, N < aD < a 
N=’ 


so that the series Xie ME, converges. Almost every number in the 
interval (0, 1) must then belong to only a finite number of sets £, and, 
hence, to only a finite number of sets eny. But this means that almost 
all numbers in the interval (0, 1) satisfy the inequality 


| Sw — Sp] < en?, 
for sufficiently large #2 and for n? < V < (n + 1}. In other words, al- 


most everywhere 


| Sa — Sp | 
n? 


<E 
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for sufficiently large u and for n? < NV < (n + 1)°. Since e is arbi- 


trarily small, it follows that almost everywhere 


Sy S ya 0 9 5 
e r A [n —>œ, MCN < (n+ 17]. 


On the basis of equation (91), it then follows that almost everywhere 


2.0 [n>o0, n? <N < (n+ 1), 


and hence, a fortiori, 


SN 
wy 9 (N —> co). 


In other words, almost everywhere, 
N 
W >» f (a,)— =», u, >Q (N => œ). (92) 


Ara 
But from ‘sath a of the —— section 











r=] r=] 
E mY z 
> <AeVk-1 ES < Ape->YR, 
=1 





where .-1; is a new positive constant. Therefore, 
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Relation (92) then gives us 


1 
o> jy t i nit ree 


re] 


almost everywhere in the interval (0, 1). This completes the proof of 
Theorem 35. 

This theorem enables us to establish quite a number of properties of 
continued fractions that are satisfied for almost all irrational numbers. 
For example, let us set 


f()=1, for r=k and f(r)=0, for rk, 


where è is some (arbitrary) natural number. In this case, the sum 


Ya (k) = f(a) 


ial 


obviously represents the number of times the integer k occurs among 
the first z elements of a given continued fraction. On the other hand, 
the ratio 


gives us the density of the number k among the first z elements of the 
given continued fraction. Finally, the limit 


lim cat) © = á (k), 


noD 


if it exists, is naturally interpreted as the density of the number & in 
the entire sequence of elements of the given continued fraction. 

Since the function f(r) that we have defined clearly satisfies all of 
the requirements of Theorem 35, we conclude, on the basis of that the- 
orem, that, for arbitrary k, this density exists almost everywhere and thal 
tt has the same value almost everywhere. Furthermore, the same theorem 
makes it possible for us to calculate the value of that density. Obvious- 
ly, we have almost everywhere 





__In4—I1n3 _ In9—I1n8 ay Inl6—In15 
Bo e Cigar Eye gs 
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and so on. Thus, an arbitrary natural number occurs as an element in 
the expansion of almost all numbers with equal average frequency. 
We obtain another interesting result by setting 


f(r)=I1nr {a nee: Y 


All the conditions of Theorem 35 are then satisfied. Therefore, we see 
that almost everywhere 


e l 
LY na> Ym hitra] (n —> 00), 


or, equivalently, 


n. oo Jin? 
/aa, ... a.- Tj in 
ne > + sega 


Thus, the geometric mean of the first z elements approaches the abso- 
lute constant 


JI l+) E ARO ipg 


r=] 


almost everywhere as 2n — œ. 

Obviously, Theorem 35 makes it possible to establislr analogous re- 
sults for a whole series of other types of average values. However, in- 
vestigation of the arithmetic mean 


iS (93) 


i=l 


by this method is impossible, because the corresponding function 
f(r) = r does not satisfy the conditions of Theorem 35. However, it 
is easy to see from more elementary considerations that, almost every- 
where, the expression (93) cannot have any kind of finite limit. For 
Theorem 30 (sec. 13) tells us that almost everywhere 


a, >ninn 
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for an infinite number of values of n, and hence, a fortiori, 


n n 


Yay n ln n, and hence, lY a>inn. 


i=l i=l 


Thus, the quantity (93) is almost everywhere unbounded and there- 
fore, as we stated, cannot have a finite limit. 
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